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This is a compact introduction to some of the principal topics of 
mathematical logic, [n the belief that beginners should be exposed to the 
easiest and most natural proofs, I have used free-swinging set-theoretic 
methods. The significance of a demand for constructive proofs can be 
evaluated only after a certain amount of experience with mathematical logic 
has been obtained. If we are to be expelled from ‘Cantor’s paradise’ (as non- 
constructive set theory was called by Hilbert), at least we should know what 
we are missing. 

The major changes in this new edition are the following. 

1 . In Chapter 2, a section has been added on logic with empty domains, that 
is, on what happens when we allow interpretations with an empty domain. ' 

2. In Chapter 4, Section 4.6 has been extended to include an outline of an 
axiomatic set theory with urelements. 

3. The subjects of register machines and random access machines have been 
dropped from Section 5.5 Chapter 5. 

4. An appendix on second-order logic will give the reader an idea of the 
advantages and limitations of the systems of first-order logic used in 
Chapters 2-4, and will provide an introduction to an area of much current 
interest. 

5. The exposition has been further streamlined, more exercises have been 
added, and the bibliography has been revised and brought up to date. 

The material of the book can be covered in two semesters, but, for a one- 
semester course. Chapters 1-3 are quite adequate (omitting, if hurried. 
Sections 1.5, 1.6 and 2.10-2.16). I have adopted the convention of prefixing 
a D to any section or exercise that will probably be difficult for a beginner, 
and an A to any section or exercise that presupposes familiarity with a topic 
that has not been carefully explained in the text. Bibliographic references are 
given to the best source of information, which is not always the earliest 
paper; hence these references give no indication as to priority. 

1 believe that the essential parts of the book can be read with ease by 
anyone with some experience in abstract mathematical thinking. There is, 
however, no specific prerequisite. 

This book owes an obvious debt to the standard works of Hilbert and 
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One of the popular definitions of logic is that it is the analysis of methods of 
reasoning. In studying these methods, logic is interested in the form rather 
than the content of the argument. For example, consider the two arguments: 

1. All men are mortal. Socrates is a man. Hence, Socrates is mortal. 

2. All cats like fish. Silvy is a cat. Hence, Silvy likes fish. 

Both have the same form: All A are B. S is an A. Hence, S is a B. The truth or 
falsity of the particular premisses and conclusions is of no concern to lo- 
gicians. They want to know only whether the premisses imply the conclu- 
sion. The systematic formalization and cataloguing of valid methods of 
reasoning are a main task of logicians. If the work uses mathematical 
techniques or if it is primarily devoted to the study of mathematical rea- 
soning, then it may be called mathematical logic. We can narrow the domain 
of mathematical logic if we define its principal aim to be a precise and 
adequate understanding of the notion of mathematical proof. 

Impeccable definitions have little value at the beginning of the study of a 
subject. The best way to find out what mathematical logic is about is to start 
doing it, and students are advised to begin reading the book even though (or 
especially if) they have qualms about the meaning and purpose of the 
subject. 

Although logic is basic to all other studies, its fundamental and appar- 
ently self-evident character discouraged any deep logical investigations until 
the late 19th century. Then, under the impetus of the discovery of non- 
Euclidean geometry and the desire to provide a rigorous foundation for 
calculus and higher analysis, interest in logic revived. This new interest, 
however, was still rather unenthusiastic until, around the turn of the cen- 
tury, the mathematical world was shocked by the discovery of the paradoxes 
- that is, arguments that lead to contradictions. The most important 
paradoxes are described here. 

1. Russell’s paradox (1902). By a set, we mean any collection of objects — for 

example, the set of all even integers or the set of all saxophone players in 

Brooklyn. The objects that make up a set are called its members or 
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elements. Sets may themselves be members of sets; for example, the set of 
all sets of integers has sets as its members. Most sets are not members of 
themselves; the set of cats, for example, is not a member of itself because 
the set of cats is not a cat. However, there may be sets that do belong to 
themselves - for example, the set of all sets. Now, consider the set .4 of all 
those sets X such thatW is not a member ofX. Clearly, by definition, A is 
a member of A if and only if A is not a member of A. So, if A is a member 
of A, then A is also not a member of A; and if A is not a member of A, then 
A is a member of A. In any case, A is a member of A and A is not a member 
of A. 

2. Cantor’s paradox (1899). This paradox involves the theory of cardinal 
numbers and may be skipped by those readers having no previous ac- 
quaintance with that theory .__The^ cardinal number F of a set Y is a 
measure of the size of the set; Y — Z if and only if Y is equinumerous with 
Z (that is, there is a one-one correspondence between Y and Z). We djefine 
Fs^Z to mean that Y is equinumerous with a subset of Z; by Y < Z we 
mean F^Z and F ^ Z. Ca ntor proved that, if ^(Y) is the set of all 
subsets of F, then F < 0>{Y). Let V be the universal set - th at is, t he set of 

all sets. Now, 0>(V) is a subset of V; so it follo ws eas ily that i?(V) ^ V. On 
the other hand, by Cantor’s Jtheorem^ V < 3P{V). Berns tein’s theorem 

asserts that, if F ^ Z and Z^F, then F = Z. Hence, V = &(V), contra- 
dicting V < 0>{y). 

3. Burali-Forti’s paradox (1897). This paradox is the analogue in the theory 
of ordinal numbers of Cantor’s paradox and requires familiarity with 
ordinal number theory. Given any ordinal number, there is a still larger 
ordinal number. But the ordinal number determined by the set of all 
ordinal numbers is the largest ordinal number. 

4. The liar paradox. A man says, ‘I am lying’. If he is lying, then what he 
says is true and so he is not lying. If he is not lying, then what he says is 
true, and so he is lying. In any case, he is lying and he is not lying.t 

5. Richard’s paradox (1905). Some phrases of the English language denote 
real numbers; for example, ‘the ratio between the circumference and 
diameter of a circle’ denotes the number n. All the phrases of the English 
language can be enumerated in a standard way: order all phrases that 
have k letters lexicographically (as in a dictionary) and then place all 
phrases with k letters before all phrases with a larger number of letters. 
Hence, all phrases of the English language that denote real numbers can 

^The Cretan ‘paradox’, known in antiquity, is similar to the liar paradox. The 
Cretan philosopher Epimenides said, ‘All Cretans are liars’. If what he said is true, 
then, since Epimenides is a Cretan, it must be false. Hence, what he said is false. 
Thus, there must be some Cretan who is not a liar. This is not logically impossible; so 
we do not have a genuine paradox. However, the fact that the utterance by Epi- 
menides of that false sentence could imply the existence of some Cretan who is not a 
liar is rather unsettling. 
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be enurifer^ted merely by omitting all other phrases in the given standard 
enumeration. Call the /ith real number in this enumeration the «th Ri- 
chard number. Consider the phrase: ‘the real number whose nth decimal 
place is 1 if the nth decimal place of the nth Richard number is not l, and 
whose / 2 th decimal place is 2 if the nth decimal place of the nth Richard 
number is 1.’ This phrase defines a Richard number - say, the k\h Ri- 
chard number; but, by its definition, it differs from the /cth Richard 
number in the kth decimal place. 

6. Berry’s paradox (1906). There are only a finite number of symbols (letters, 
punctuation signs, etc.) in the English language. Hence, there are only a 
finite number of English expressions that contain fewer than 200 occur- 
rences of symbols (allowing repetitions). There are, therefore, only a finite 
number of positive integers that are denoted by an English expression 
containing fewer than 200 occurrences of symbols. Let k be the least 
positive integer that is not denoted by an English expression containing 
fewer than 200 occurrences of symbols. The italicized English phrase 
contains fewer than 200 occurrences of symbols and denotes the integer k. 

7. Grelling’s paradox (1908). An adjective is called autological if the property 
denoted by the adjective holds for the adjective itself. An adjective is 
called heterological if the property denoted by the adjective does not 
apply to the adjective itself. For example, ‘polysyllabic’ and ‘English’ are 
autological, whereas ‘monosyllabic’ and ‘French’ are heterological. 
Consider the adjective ‘heterological’. If ‘heterological’ is heterological, 
then it is not heterological. If ‘heterological’ is not heterological, then it is 
heterological. In either case, ‘heterological’ is both heterological and not 
heterological. 

8. Lob’s paradox (1955). Let A be any sentence. Let B be the sentence: ‘If this 
sentence is true, then A’. So, B asserts: ‘If B is true, then .4’. Now consider 
the following argument: Assume B is true; then, by B, since B is true, A 
holds. This argument shows that, if B is true, then A. But this is exactly 
what B asserts. Hence, B is true. Therefore, by B, since B is true, A is true. 
Thus, every sentence is true. 

All of these paradoxes are genuine in the sense that they contain no 
obvious logical flaws. The logical paradoxes (1-3) involve only notions from 
the theory of sets, whereas the semantic paradoxes (4—8) also make use of 
concepts like ‘denote’, ‘true’ and ‘adjective’, which need not occur within 
our standard mathematical language. For this reason, the logical paradoxes 
are a much greater threat to a mathematician’s peace of mind than the 
semantic paradoxes. 

Analysis of the paradoxes has led to various proposals for avoiding them. 
All of these proposals are restrictive in one way or another of the ‘naive’ 
concepts that enter into the derivation of the paradoxes. Russell noted the 
self-reference present in all the paradoxes and suggested that every object 
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must have a definite non-negative integer as its ‘type’. Then an expression ( x 
is a member of the set y is to be considered meaningful if and only if the 
type of y is one greater than the type of x. 

This approach, known as the theory of types and systematized and de- 
veloped in Principia Mathematica Whitehead and Russell (1910-13), is 
successful in eliminating the known paradoxes,* but it is clumsy in practice 
and has certain other drawbacks as well. A different criticism of the logical 
paradoxes is aimed at their assumption that, for every property P(x) , there 
exists a corresponding set of all objects x that satisfy P(x) . If we reject this 
assumption, then the logical paradoxes are no longer derivable.* It is ne- 
cessary, however, to provide new postulates that will enable us to prove the 
existence of those sets that are needed by the practising mathematician. The 
first such axiomatic set theory was invented by Zermelo (1908). In Chapter 4 
we shall present an axiomatic theory of sets that is a descendant of Zer- 
melo’s system (with some new twists given to it by von Neumann, R. Ro- 
binson, Bernays, and Godel). There are also various hybrid theories 
combining some aspects of type theory and axiomatic set theory - for ex- 
ample, Quine’s system NF. 

A more radical interpretation of the paradoxes has been advocated by 
Brouwer and his intuitionist school (see Hey ting, 1956). They refuse to 
accept the universality of certain basic logical laws, such as the law of 
excluded middle: P or not-P. Such a law, they claim, is true for finite sets, 
but it is invalid to extend it on a wholesale basis to all sets. Likewise, they 
say it is invalid to conclude that ‘There exists an object jc such that not-P (a-)’ 
follows from the negation of ‘For all x, P{xf; we are justified in asserting the 
existence of an object having a certain property only if we know an effective 
method for constructing (or finding) such an object. The paradoxes are not 
derivable (or even meaningful) if we obey the intuitionist strictures, but so 
are many important theorems of everyday mathematics, and, for this rea- 
son, intuitionism has found few converts among mathematicians. 

Whatever approach one takes to the paradoxes, it is necessary first to 
examine the language of logic and mathematics to see what symbols may be 
used, to determine the ways in which these symbols are put together to form 
terms, formulas, sentences and proofs, and to find out what can and cannot 
be proved if certain axioms and rules of inference are assumed. This is one of 
the tasks of mathematical logic, and, until it is done, there is no basis for 

1 Russells’s paradox, for example, depends on the existence of the set A of all sets 
that are not members of themselves. Because, according to the theory of types, it is 
meaningless to say that a set belongs to itself, there is no such set A. 

* Russell’s paradox then proves that there is no set A of all sets that do not 
belong to themselves. The paradoxes of Cantor and Burali-Forti show that there is 
no universal set and no set that contains all ordinal numbers. The semantic para- 
doxes cannot even be formulated, since they involve notions not expressible within 
the system. 
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comparing rival foundations of logic and mathematics. The deep and de- 
vastating results of Godel, Tarski, Church, Rosser, Kleene, and many others 
have been ample reward for the labour invested and have earned for 
mathematical logic its status as an independent branch of mathematics. 

For the absolute novice a summary will be given here of some of the basic 
notation, ideas, and results used in the text. The reader is urged to skip these 
explanations now and, if necessary, to refer to them later on. 

A set is a collection of objects.! The objects in the collection are called 
elements or members of the set. We shall write ‘x e y’ for the statement that 
x is a member of y. (Synonymous expressions are ‘x belongs to y’ and ‘y 
contains x\) The negation of ‘x G y’ will be written ( x^y\ 

By ‘jc C y we mean that every member of x is also a member of y (sy- 
nonymously, that x is a subset of y, or that x is included in 3). We shall write 
7 = s’ to mean that t and s denote the same object. As usual, ‘ t y s’ is the 
negation of l t = s\ For sets x and y, we assume that x = y if and only if x C y 
and y C x - that is, if and only if x and y have the same members. A set x is 
called a proper subset of a set y, written ‘x C y 1 if x C y but x ^ y. (The 
notation x % y is often used instead of x C 3.) 

The union x U 3’ of sets x and y is defined to be the set of all objects that are 
members of x or y or both. Hence, x U x = x, x U y — y U x, and (rUj')Uz = 
xU (y Uz). The intersection xflj ; is the set of objects that x and y have in 
common. Therefore, xflx = x, x n y = y n x, and (xflj ; ) flz = xfl (yf\z). 
Moreover, x n (y U z) = (x n y) U (x n z) and x U (y n z) = (x U 3’) n (x U z) . 
The relative complement x — 3 is the set of members of x that are not 
members of y. We also postulate the existence of the empty set (or null set) 
0 - that is, a set that has no members at all. Then x fl 0 = 0 , x U 0 = x, 
x _ 0 = x, 0 — x = 0 , and x — x = 0 . Sets x and y are called disjoint if 
x n y — 0. 

Given any objects by , . . . , bk, the set that contains b \ , . . . , bk as its only 
members is denoted {b \ , . . . , bk}. In particular, {x, 3 ; } is a set having x and y 
as its only members and, if x y y, is called the unordered pair of x and y. The 
set {x,x} is identical with {x} and is called the unit set of x. Notice that 
(x-,3;} = {y,x}. By (£1 , , bk) we mean the ordered k- tuple of b \ , . . . , bk. The 
basic property of ordered ^-tuples is that {b\ , . . . , b k ) = {c\ , . . . , Ck) if and 
only if b\ = c\, #2 = £2, • • • > bk — Ck - Thus, (£1, £2) = (&2j &i) if and only if 
b\ = £>2- Ordered 2-tuples are called ordered pairs. The ordered 1 -tuple (b) is 
taken to be b itself. If X is a set and k is a positive integer, we denote by X k 
the set of all ordered ^-tuples {b\ , . . . , bk) of elements b\,...,bk of X. In 

f Which collections of objects form sets will not be specified here. Care will be 
exercised to avoid using any ideas or procedures that may lead to the paradoxes; all 
the results can be formalized in the axiomatic set theory of Chapter 4. The term 
‘class’ is sometimes used as a synonym for ‘set’, but it will be avoided here because it 
has a different meaning in Chapter 4. If a property P(x) does determine a set, that set 
is often denoted {x | P(x)}. 
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particular, X 1 is X itself. If Y and Z are sets, then by Y x Z we denote the set 
of all ordered pairs (y , z) such that y € Y and zgZ. Y x Z is called the 
Cartesian product of Y and Z. 

An n-place relation (or a relation with n arguments ) on a set X is a subset 
of X n - that is, a set of ordered / 2 -tuples of elements of X. For example, the 
3-place relation of betweenness for points on a line is the set of all 3-tuples 
(jc,y,z) such that the point x lies between the points y and z. A 2-place 
relation is called a binary relation; for example, the binary relation of fa- 
therhood on the set of human beings is the set of all ordered pairs (jc,y) such 
that x and y are human beings and jc is the father of y. A 1 -place relation on 
X is a subset of X and is called a property on X. 

Given a binary relation R on a set X, the domain of R is defined to be the 
set of all y such that ( y, z) e R for some z; the range of R is the set of all z 
such that ( y, z) e f? for some y; and the field of R is the union of the domain 
and range of R. The inverse relation R ~ 1 of R is the set of all ordered pairs 
{y,z} such that (z,y) E R. For example, the domain of the relation < on the 
set co of non-negative integers^ is co, its range is co — {0}, and the inverse of 
< is >. Notation: Very often x/ty is written instead of (jc,y) € R. Thus, in the 
example just given, we usually write x < y instead of (jc, y) E <. 

A binary relation R is said to be reflexive if xRx for all x in the field off?; R 
is symmetric if xRy implies yRx; and R is transitive if xRy and yRz imply xRz. 
Examples: The relation ^ on the set of integers is reflexive and transitive 
but not symmetric. The relation ‘having at least one parent in common’ on 
the set of human beings is reflexive and symmetric, but not transitive. 

A binary relation that is reflexive, symmetric and transitive is called an 
equivalence relation. Examples of equivalence relations are: (1) the identity 
relation lx on a set X, consisting of all pairs (jc,jc), where jc'e X; (2) given a 
fixed positive integer n, the relation jc = y (mod n), which holds when x and y 
are integers and jc — is divisible by n; (3) the congruence relation on the set 
of triangles in a plane; (4) the similarity relation on the set of triangles in a 
plane. Given an equivalence relation R whose field is X , and given any 
y E X y define [jJ as the set of all z in X such that yRz. Then [y] is called the 
R- equivalence class of y. Clearly, [w] = [t>] if and only if uRv. Moreover, if 
[w] / [y], then [u] D [u] = 0; that is, different ^-equivalence classes have no 
elements in common. Hence, the set X is completely partitioned into the 
^-equivalence classes. In example (1) above, the equivalence classes are just 
the unit sets {jc}, where jc E X. In example (2), there are n equivalence 
classes, the frth equivalence class (k = 0, 1, . . . ,n — 1) being the set of all 
integers that leave the remainder k upon division by /;. 

A function f is a binary relation such that (jc,y) E / and (jc,z) e / imply 
y = z. Thus, for any element jc of the domain of a function /, there is a 
unique y such that (jc, y) E /; this unique y is denoted /(jc). If jc is in the 

tco will also be referred to as the set of natural numbers. 
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domain of /> then / (x) is said to be defined. A function / with domain X and 
range Y is said to be a function fromX onto Y. If / is a function fromX onto 
a subset of Z, then / is said to be a function from X into Z. For example, if 
the domain of / is the set of integers and /(x) = 2x for every integer x, then 
f js a function from the set of integers onto the set of even integers, and / is 
a function from the set of integers into the set of integers. A function whose 
domain consists of /j-tuples is said to be & function of n arguments. A total 
function of n arguments on a set X is a function / whose domain is X n . It is 
customary to write f(x 1, . . . , x„ ) instead of /((xi, . . . ,x„)), and we refer to 
f(xi ,. . . , as the value of / for the arguments x i, . . . ,x„. A partial function 
of n arguments on a set X is a function whose domain is a subset of X For 
example, ordinary division is a partial, but not total, function of two ar- 
guments on the set of integers, since division by 0 is not defined. If / is a 
function with domain X and range Y, then the restriction fz of / to a set Z is 
the function / n (Z x 7). Then fz{u) = v if and only if u e Z and f(u) — v. 
The image of the set Z under the function / is the range of fz- The inverse 
image of a set W under the function / is the set of all u in the domain of / 
such that / (w) e W. We say that / maps X onto (into) Y if AT is a subset of 
the domain off and the image of X under / is (a subset of) Y. By an n-place 
operation (or operation with n arguments ) on a set X we mean a function 
from X 11 into X. For example, ordinary addition is a binary (i.e., 2-place) 
operation on the set of natural numbers {0, 1,2, •••}. But ordinary sub- 
traction is not a binary operation on the set of natural numbers. 

The composition fog (sometimes denoted fg) of functions / and g is the 
function such that (/ o g)(x) — f{g{x)); (/ o g)(x) is defined if and only if 
g{x) is defined and f(g(x)) is defined. For example, if g(x) = x 2 and 
f(x) — x + 1 for every integer jc, then (f o g) (jc) = jc 2 + 1 and (g o f)(x) = 
(jc + l) 2 - Also, if h(x) = —x for every real number x and / (jc) = \/x for every 
non-negative real number jc, then (f o h)(x) is defined only for x^O, and, for 
such x, (f o /?)(x) = \f—x. A function / such that/(x) = f(y) implies x = y is 
called a 1-1 ( one-one ) function. For example, the identity relation l x on a set 
X is a 1-1 function, since I x (y ) = y for every y E X; the function g with 
domain co, such that g(x) — 2x for every x € co , is 1—1; but the function h 
whose domain is the set of integers and such that h (x) = x 2 for every integer 
x is not 1-1, since h{— 1) = /?( 1 ) . Notice that a function / is 1-1 if and only if 
its inverse relation / _1 is a function. If the domain and range of a 1-1 
function / are X and Y, then/ is said to be a 1 — 1 {one-one) correspondence 
between X and Y; then / 1 is a 1-1 correspondence between Y and X, and 
(/ _1 of) — l x and (/ o f~ l ) — 1 Y . If / is a 1-1 correspondence between X 
and Y and g is a 1-1 correspondence between Y and Z, then g o f is a 1-1 
correspondence between X and Z. Sets X and Y are said to be equinumerous 
(written X = F) if and only if there is a l-l correspondence between X and 
Y. Clearly, X = X , X = Y implies Y = X, and X ^ Y and Y = Z implies 
X = Z. It is somewhat harder to show that, if X = Y\ C Y and Y = X\ c X, 




then X = Y (see Bernstein’s theorem in Chapter 4). If X = Y, one says that 
X and Y have the same cardinal number , and if X is equinumerous with a 
subset of Y but Y is not equinumerous with a subset of X, one says that the 
cardinal number of X is smaller than the cardinal number of K.' 1 

A set X is denumerable if it is equinumerous with the set of positive 
integers. A denumerable set is said to have cardinal number No, and any set 
equinumerous with the set of all subsets of a denumerable set is said to have 
the cardinal number 2^° (or to have the power of the continuum). A set X is 
finite if it is empty or if it is equinumerous with the set (1,2 ,...,«} of all 
positive integers that are less than or equal to some positive integer n. A set 
that is not finite is said to be infinite. A set is countable if it is either finite or 
denumerable. Clearly, any subset of a denumerable set is countable. A 
denumerable sequence is a function s whose domain is the set of positive 
integers; one usually writes s„ instead of s(/i). A finite sequence is a function 
whose domain is the empty set or {1,2, . . . ,«} for some positive integer n. 

Let / ) (x,yi , . . . ,}>*) be some relation on the set of non-negative integers. In 
particular, P may involve only the variable x and thus be a property. If 
P(0,yi, . . . ,y*) holds, and, if, for every n, P{n,y\, • • • ,J ; *) implies 
P(ti -I- l,Li, . • . then P(x 7 y \ , . . . ,y*) is true for all non-negative integers x 

( principle of mathematical induction). In applying this principle, one usually 
proves that, for every n , P(n,yi, . . . , yp ) implies P(n + 1 , y \ , . . . ,j>) by as- 
suming P(n,yi , . . . ,y*) and then deducing P(n + l,y\, . . . ,y*); in the course 
of this deduction, P{n,y\, . . . ,j^) is called the inductive hypothesis. If the 
relation P actually involves variables yi,.-.,yk other than x, then the proof is 
said to proceed by induction on x. A similar induction principle holds for the 
set of integers greater than some fixed integer j. An example is: to prove by 
mathematical induction that the sum of the first n odd integers 
1 + 3 + 5 + . . . + (2« — 1) is n 2 , first show that 1 = l 2 (that is, P(l)), and 
then, that if 1 + 3 + 5 + . . . + ( 2/2 — 1) = « 2 , then 1 + 3 -1- 5 + . . . + (2/2 — 1) 
+( 2/2 + 1) — (n + l) 2 (that is, if P(n) then P{n + 1)). From the principle of 
mathematical induction one can prove the principle of complete induction : If, 
for every non-negative integer x the assumption that P(u,y i, . . . ,y*) is true 
for all u < x implies that P(x,y\, . . . ,yf) holds, then, for all non-negative 
integers x , P(x, yi , . . . , y*) is true, (Exercise: Show by complete induction 
that every integer greater than 1 is divisible by a prime number.) 

A partial order is a binary relation R such that R is transitive and, for 
every x in the field of R, xRx is false. If R is a partial order, then the relation 
R' that is the union of R and the set of all ordered pairs (x,x), where x is in 
the field of R, we shall call a reflexive partial order, in the literature, ‘partial 
order’ is used for either partial order or reflexive partial order. Notice that 


*One can attempt to define the cardinal number of a set A as the collection [X] of 
all sets equinumerous with X. However, in certain axiomatic set theories, [X] does 
not exist, whereas in others [X] exists but is not a set. 
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(xRy and yRx) is impossible if R is a partial older, whereas (xRy and yRx) 
implies x = y if R is a reflexive partial order. A (reflexive) total order is a 
(reflexive) partial order such that, for any x and y in the field of R, either 
x = y or xRy or yRx. Examples: (1) the relation < on the set of integers is a 
total order, whereas ^ is a reflexive total order; (2) the relation c on the set 
of all subsets of the set of positive integers is a partial order but not a total 
order, whereas the relation C is a reflexive partial order but not a reflexive 
total order. If B is a subset of the field of a binary relation then an element 
y of B is called an R-least element of B if yRz for every element z of B different 
from y. A well-order (or a well-ordering relation ) is a total order R such that 
every non-empty subset of the field of R has an iMeast element. Examples: 
(1) the relation < on the set of non-negative integers is a well-order; (2) the 
relation < on the set of non-negative rational numbers is a total order but 
not a well-order; (3) the relation < on the set of integers is a total order but 
not a well-order. Associated with every well-order R having field X there is a 
complete induction principle: if P is a property such that, for any u in X, 
whenever all z in X such that zRu have the property P, then u has the 
property P, then it follows that all members of X have the property P. If the 
set X is infinite, a proof using this principle is called a proof by transfinite 
induction. One says that a set X can be well-ordered if there exists a well- 
order whose field is X . An assumption that is useful in modern mathematics 
but about the validity of which there has been considerable controversy is 
the well-ordering principle : every set can be well-ordered. The well-ordering 
principle is equivalent (given the usual axioms of set theory) to the axiom of 
choice: for any set X of non-empty pairwise disjoint sets, there is a set Y 
(called a choice set) that contains exactly one element in common with each 
set in X. 

Let B be a non-empty set, / a function from B into B, and g a function 
from B 1 2 3 * * * into B. Write x' for / (x) and jcfly for g(x, y). Then {B,f, g) is called 
a Boolean algebra if B contains at least two elements and the following 
conditions are satisfied: 

1. x fly = y n x for all x and y in B 

2. (jc fT y) n z = x D (y Pi z) for all jc, y, z in B 

3. jc fly 7 = z n z' if and only if x fly = x for all jt,y, z in B. 

Let xUy stand for (x‘ fly'/, and write jc^j; for jc = jc. It is easily proved 

that z n z ' = w n vJ for any w and z in B ; we denote the value of z n z ' by 0. 

Let 1 stand for O'. Then zllz' = 1 for all z in B. Note also that ^ is a 

reflexive partial order on B , and (5,/, U) is a Boolean algebra. (The symbols 
D, U, 0, 1 should not be confused with the corresponding symbols used in set 
theory and arithmetic.) An ideal J in (B, f,g) is a non-empty subset of B 
such that (1) if jc £ J and y £ /, then x U y £ /, and (2) if jc £ J and y £ B, 
then jc n y £ J. Clearly, {0} and B are ideals. An ideal different from B is called 
a proper ideal. A maximal ideal is a proper ideal that is included in no other 
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proper ideal. It can be shown that a proper ideal J is maximal if and only if, 
for any u in B, u e J or «' € J. From the axiom of choice it can be proved 
that every Boolean algebra contains a maximal ideal, or, equivalently, that 
every proper ideal is included in some maximal ideal. For example, let B be 
the set of all subsets of a setX; for Y € B, let Y f — X — Y, and for Y and Z in 
B, let rnZ be the ordinary set-theoretic intersection of F and Z. Then 
( B / , n) is a Boolean algebra. The 0 of B is the empty set 0, and 1 is X. For 
each element u in X , the set J u of all subsets of X that do not contain it is a 
maximal ideal. For a detailed study of Boolean algebras, see Sikorski (1960), 
Halmos (1963) and Mendelson (1970). 
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u PROPOSITIONAL CONNECTIVES. TRUTH TABLES 

Sentences may be combined in various ways to form more complicated 
sentences. We shall consider only truth-functional combinations, in which 
the truth or falsity of the new sentence is determined by the truth or falsity 
of its component sentences. 

Negation is one of the simplest operations on sentences. Although a sen- 
tence in a natural language may be negated in many ways, we shall adopt a 
uniform procedure: placing a sign for negation, the symbol -i, in front of the 
entire sentence. Thus, if A is a sentence, then -nA denotes the negation of A. 

The truth-functional character of negation is made apparent in the fol- 
lowing truth table : ^ , 

T F 
F T 

When A is true, ->A is false; when A is false, ->A is true. We use T and F to 
denote the truth values true and false. 

Another common truth-functional operation is the conjunction: ‘and’. The 
conjunction of sentences A and B will be designated by A A B and has the 
following truth table: 

ABA AB 
T T T 

FT F 

T F F 

F F F 

A A B is true when and only when both A and B are true. A and B are called 
the conjuncts of A AB. Note that there are four rows in the table, corre- 
sponding to the number of possible assignments of truth values to A and B. 

In natural languages, there are two distinct uses of ‘or’: the inclusive and 
the exclusive. According to the inclusive usage, i A or B ’ means ‘ A or B or 
both’, whereas according to the exclusive usage, the meaning is ‘ A or B, but 
not both’. We shall introduce a special sign, V, for the inclusive connective. 
Its truth table is as follows: 
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A B 
T T 
F T 
T F 
F F 


A VB 
T 
T 
T 
F 


Thus, A V B is false when and only when both A and B are false. *A V B ’ is 
called a disjunction , with the disjuncts A and B. 

Another important truth-functional operation is the conditional: ‘if A, 
then B\ Ordinary usage is unclear here. Surely, ‘if A, then B ’ is false when 
the antecedent A is true and the consequent B is false. However, in other 
cases, there is no well-defined truth value. For example, the following sen- 
tences would be considered neither true nor false: 

1 . If 1 + 1 = 2, then Paris is the capital of France. 

2. If 1 + 1 7 ^ 2, then Paris is the capital of France. 

3. If 1 + 1 7 ^ 2, then Rome is the capital of France. 

Their meaning is unclear, since we are accustomed to the assertion of some 
sort of relationship (usually causal) between the antecedent and the con- 
sequent. We shall make the convention that ‘if A, then Z?’ is false when and 
only when A is true and B is false. Thus, sentences 1—3 are assumed to be 
true. Let us denote ‘if A, then Z?’ by ‘ A =>- B\ An expression ‘A =>- B’ is called 
a conditional. Then => has the following truth table: 


A B A=> B 

T T T 

FT T 
T F F 

F F T 

This sharpening of the meaning of ‘if A, then 5’ involves no conflict with 
ordinary usage, but rather only an extension of that usage. i 

A justification of the truth table for => is the fact that we wish ‘if A and B, 
then B' to be true in all cases. Thus, the case in which A and B are true justifies 
the first fine of our truth table for =^, since (A and B) and B are both true. If A is 


f There is a common non-truth-functional interpretation of ‘if A, then B' connected 
with causal laws. The sentence ‘if this piece of iron is placed in water at time t, then the 
iron will dissolve’ is regarded as false even in the case that the piece of iron is not placed 
in water at time t - that is, even when the antecedent is false. Another non-truth- 
functional usage occurs in so-called counterfactual conditionals, such as ‘if Sir Walter 
Scott had not written any novels, then there would have been no War Between the 
States’. (This was Mark Twain’s contention in Life on the Mississippi : ‘Sir Walter had 
so large a hand in making Southern character, as it existed before the war, that he is in 
great measure responsible for the war’.) This sentence might be asserted to be false even 
though the antecedent is admittedly false. However, causal laws and counterfactual 
conditions seem not to be needed in mathematics and logic. For a clear treatment of 
conditionals and other connectives, see Quine (1951). (The quotation from Life on the 
Mississippi was brought to my attention by Professor J.C. Owings, Jr.) 
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false and B true, then (A and B ) is false while B is true. This corresponds to the 
second line of the truth table. Finally, iL4 is false and Sis false, (A and B) is false 
and B is false. This gives the fourth line of the table. Still more support for our 
definition comes from the meaning of statements such as ‘for every x, if x is an 
odd positive integer, then x 2 is an odd positive integer’. This asserts that, for 
every x, the statement ‘if x is an odd positive integer, then x 2 is an odd positive 
integer’ is true. Now we certainly do not want to consider cases in which x is not 
an odd positive integer as counterexamples to our general assertion. This 
supports the second and fourth lines of our truth table. In addition, any case in 
which x is an odd positive integer andx 2 is an odd positive integer confirms our 
general assertion. This corresponds to the first line of the table. 

Let us denote ‘ A if and only if B ’ by ‘A 4$ B’. Such an expression is called 
a biconditional. Clearly, A B is true when and only when A and B have the 
same truth value. Its truth table, therefore is: 

A B A&B 

T T T 

FT F 
T F F 

F F T 

The symbols -i, A,V,=* and will be called propositional connectives d 
Any sentence built up by application of these connectives has a truth value 
that depends on the truth values of the constituent sentences. In order to 
make this dependence apparent, let us apply the name statement form to an 
expression built up from the statement letters A,B,C, and so on by appro- 
priate applications of the propositional connectives. 

1. All statement letters (capital italic letters) and such letters with numerical 
subscripts^ are statement forms. 

2. If 3ft and are statement forms, then so are {->3ft), (33 A ^), 

(3ft V ( 3), (3ft => c 3), and {3$ < $). 

3. Only those expressions are statement forms that are determined to be so 
by means of conditions 1 and 2. § 

Some examples of statement forms are B , (-C 2 ), (Z>3 a H?)), 
(((-.Bi) V Bi) =*■ (A, A C 2 )), and (((-v4) o^)e(C=>(iV C))). 


t\Ve have been avoiding and shall in the future avoid the use of quotation marks 
to form names whenever this is not likely to cause confusion. The given sentence 
should have quotation marks around each of the connectives. See Quine (1951, pp. 
23-27). 

tFor example, Ai,A 2 ,An,Bji,C 2 , 

§ This can be rephrased as follows: ^ is a statement form if and only if there is a 
finite sequence 38 \, . . . , 38„(n > 1) such that 3ft tl = # and, if 1 3ft t is either a 

statement letter or a negation, conjunction, disjunction, conditional or biconditional 
constructed from previous expressions in the sequence. Notice that we use script 
letters s/, 3$, ( d , . . . to stand for arbitrary expressions, whereas italic letters are used 
as statement letters. 
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For every assignment of truth values T or F to the statement letters that 
occur in a statement form, there corresponds, by virtue of the truth tables 
for the propositional connectives, a truth value for the statement form. 
Thus, each statement form determines a truth function, which can be gra- 
phically represented by a truth table for the statement form. For example, 
the statement form (((-<T) V B) =$■ C) has the following truth table: 


A B 
T T 
F T 
T F 
F F 
T T 
F T 
T F 
F F 


M) (M)vi?) 

F T 

T T 

F F 

T T 

F T 

T T 

F F 

T T 


(((~^) V B)=>C) 

T 

T 

T 

T 

F 

F 

T 

F 


Each row represents an assignment of truth values to the statement letters 
A,B and C and the corresponding truth values assumed by the statement 
forms that appear in the construction of (((->^4) V B) =>- C). 

The truth table for ((A b)=KM)ab)) is as follows: 


A B (A o B) 

T T T 

FT F 
T F F 

F F T 



(M)A B) 
F 
T 
F 
F 


((^5)^(H)A5)) 

F 

T 

T 

F 


If there are n distinct letters in a statement form, then there are 2" possible 
assignments of truth values to the statement letters and, hence, 2” rows in 
the truth table. 

A truth table can be abbreviated by writing only the full statement form, 
putting the truth values of the statement letters underneath all occurrences 
of these letters, and writing, step by step, the truth values of each component 
statement form under the principal connective of the fornd. As an example, 
for ((A & B) ((-i4) A B)), we obtain: 




^The principal coimective of a statement form is the one that is applied last in 
constructing the form. 
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Exercises 

* 

\,\ Write the truth table for the exclusive usage of ‘or’. 

1 2 Construct truth tables for the statement forms {{A B)\J (~>A)) and 
((A C)) => ((A => B) => (A =>■ C))). 

U3 Write abbreviated truth tables for ((A ^>B) A A) and {{A V (~>C)) B). 

1.4 Write the following sentences as statement forms, using statement letters 
to stand for the atomic sentences - that is, those sentences that are not built 
up out of other sentences. 

(a) If Mr Jones is happy, Mrs Jones is not happy, and if Mr Jones is not 
happy, Mrs Jones is not happy. 

(b) Either Sam will come to the party and Max will not, or Sam will not 
come to the party and Max will enjoy himself. 

(c) A sufficient condition for x to be odd is that x is prime. 

(d) A necessary condition for a sequence s to converge is that s be bounded. 

(e) A necessary and sufficient condition for the sheikh to be happy is that he 
has wine, women and song. 

(f) Fiorello goes to the movies only if a comedy is playing. 

(g) The bribe will be paid if and only if the goods are delivered. 

(h) If x is positive, x 2 is positive. 

(i) Karpov will win the chess tournament unless Kasparov wins today. 

1.2 TAUTOLOGIES 

A truth function of n arguments is defined to be a function of n arguments, 
the arguments and values of which are the truth values T or F. As we have 
seen, any statement form containing n distinct statement letters determines a 
corresponding truth function of n arguments.! 


fTo be precise, enumerate all statement letters as follows: A,B,..., 
Z-,Ai,Bi, . . . ,Zi;A 2l . • . If a statement form contains the qth,. statement let- 
ters in this enumeration, where /j ^ then the corresponding truth function is 

to have x,-, , . . . ,x, n , in that order, as its arguments, where Xj. corresponds to the if 
statement letter. For example, (A B) generates the truth function 

Xl x 2 f{x u x 2 ) 

T T T 

FT T 
T F F 

F F T 


r 

whereas (B => A) generates the truth function 


A'l x 2 

T T 
F T 
T F 
F F 


g(x i,x 2 ) 

T 

F 

T 

T 
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A statement form that is always true, no matter what the truth values of 
its statement letters may be, is called a tautology. A statement form is a 
tautology if and only if its corresponding truth function takes only the value 
T, or equivalently, if, in its truth table, the column under the statement form 
contains only Ts. An example of a tautology is (A V (-u4)), the so-called law 
of the excluded middle. Other simple examples are (-'(A A hm, 
(A & (-.(-V4))), (04 A B) =* A) and (A =* (A V B)). 

PA is said to logically imply Pi (or, synonymously, ^ is a logical con- 
sequence of PS) if and only if every truth assignment to the statement letters 
of PS and Pi that makes PS true also makes P true. For example, (A A B) 
logically implies A, A logically implies (A V B), and (A A (A =>• B)) logically 
implies B. 

Pd and ^ are said to be logically equivalent if and only if PS and Pi receive 
the same truth value under every assignment of truth values to the statement 
letters of PS and Pi. For example, A and ( — >(— ) ) are logically equivalent, as 
are (A A B) and {B f\ A). 


PROPOSITION 1.1 

(a) PS logically implies Pi if and only if {PS =$■ Pi) is a tautology. 

(b) PS and Pi are logically equivalent if and only if {PS P>) is a tautology. 

Proof 

(a) (i) Assume PS logically implies Pi. Hence, every truth assignment that 
makes PS true also makes P> true. Thus, no truth asssignment makes PS 
true and P> false. Therefore, no truth assignment makes {PS Pi) false, 
that is, every truth assignment makes {PS =>• P J ) true. In other words, 
{PS => P’) is a tautology, (ii) Assume {PS Pi) is a tautology. Then, for 
every truth assignment, {PS — > Pi) is true, and, therefore, it is not the case 
that PS is true and P> false. Hence, every truth assignment that makes PS 
true makes Pi true, that is, PS logically implies Pi. 

(b) {PS 44* is a tautology if and only if every truth assignment makes 
{PS Pi) true, which is equivalent to saying that every truth assignment 
gives PS and P the same truth value, that is, PS and Pi are logically 
equivalent. 

By means of a truth table, we have an effective procedure for determining 
whether a statement form is a tautology. Hence, by Proposition 1 . 1 , we have 
effective procedures for determining whether a given statement form logi- 
cally implies another given statement form and whether two given statement 
forms are logically equivalent. 

To see whether a statement form is a tautology, there is another method 
that is often shorter than the construction of a truth table. 
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Examples 

1 Determine whether {{A 44 ((->#) VC))^ ((->yl) =4 B)) is a tautology. 


Assume that the statement form 
sometimes is F (line 1). Then (A 44 
((-nB) V C)) is T and ((-*4) =4- B) is 
F (line 2). ^nce ((-i4) =4 B) is F, 
f-,A) is T and B is F (line 3). Since 
(->A) is T, A is F (line 4). Since A is 
F and (A 44 ((^B) VC)) is T, 
((~iB) VC) is F (line 5). Since 
((- ,b ) V C) is F, (-i B) and C are F 
(line 6). Since (->B) is F, B is T (line 
7). But B is both T and F (lines 7 
and 3). Hence, it is impossible for 
the form to be false. 


({Ao(nm v Q)=>(nA)^B)) 

F 

T F 

T F 

F F 

F 

F F 
T 


1 

2 

3 

4 

5 

6 
7 


2. Determine whether ( (A =4- (B V C) ) V {A 
Assume that the form is F (line 1). 
Then (A =4- (/? V C )) and (A => B) are F 
(line 2). Since (A =4 B) is F, A is T and B is 
F (line 3). Since (A =4 (B V C)) is F, A is T 
and ( B V C) is F (line 4). Since (B V C) is 
F, B and C are F (line 5). Thus, when A is 
T, B is F, and C is F, the form is F. 
Therefore, it is not a tautology. 


B )) is a tautology. 


((^(BvQ)v(^B)) 

F 

F F 

T F 

T F 
F F 


1 

2 

3 

4 

5 


Exercises 

1.5 Determine whether the following are tautologies. 

(a) (((v» => B) =e B) => B) (f) (A =b (B => (B => .4))) 

(b) (((A =b B) =>• B) =s- A) (g) ((A A 5) => V C)) 

(c) (((A =$■ B) =$■ A) => A) (h) ((A aa B) (A (B & A))) 

(d) (((B => C) =$■ (A =5> B)) ^(A^ B)) (i) ((A =s- B) V (B =s- A)) 

(e) ((dvH*Aq))^((d»C)vS)) 0) (("'(-'< =b -B)) => A) 

1.6 Determine whether the following pairs are logically equivalent. 

(a) {{A =4 B) =4 A) and A 

(b) (A 44 B) and ((A =4- B) A (5 =4 ^4)) 

(c) ((-vl) V B) and ((- E ) V^4) 

(d) (~<(A 44 5)) and (.4 44 (-<#)) 

(e) (A V (5 44 C)) and ((A VB)&(AV C)) 

(f) (,4 =4 (£ 44 C)) and ((A =4 #) 44 (yl =4 C)) 

(g) (A A (B <=> C)) and ((yl A^) ^ (4 A C)) 
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1.7 Prove: 

(a) (A => B) is logically equivalent to ((->/}) \f B). 

(b) (A =y> B) is logically equivalent to (->(A A (->#))). 

1.8 Prove that 38 is logically equivalent to ^ if and only if (A logically 
implies <3 and ^ logically implies 38. 

1.9 Show that 38 and are logically equivalent if and only if, in their truth 
tables, the columns under 38 and are the same. 

1.10 Prove that 38 and are logically equivalent if and only if (->38) and 
(-83) are logically equivalent. 

1.11 Which of the following statement forms are logically implied by 
(A 3B)1 

(a) A (d) (hA) V B) (g) (A => B) 

(b) B (e) ((-iB) =* A) (h) ((-iB) =* (^A)) 

(c) (A V B) (f) (A « B) (i) (A A (-B)) 

1.12 Repeat Exercise 1.11 with (A A B) replaced by (A => B) and by 
(-i (A => B)), respectively. 

1.13 Repeat Exercise 1.11 with (A A B) replaced by (AVB). 

1.14 Repeat Exercise 1.11 with (AaB) replaced by (A 45 B) and by 

(-i (A B)), respectively. 

A statement form that is false for all possible truth values of its statement 
letters is said to be contradictory. Its truth table has only Fs in the column 
under the statement form. One example is (A hA)): 

A 0A) (A ^ (-d)) 

T F F 
FT F 


Another is (A A HO)- 

Notice that a statement form 38 is a tautology if and only if (->0) is 
contradictory, and vice versa. 

A sentence (in some natural language like English or in a formal theory) t 
that arises from a tautology by the substitution of sentences for all the 
statement letters, with occurrences of the same statement letter being re- 
placed by the same sentence, is said to be logically true (according to the 
propositional calculus). Such a sentence may be said to be true by virtue of 
its truth-functional structure alone. An example is the English sentence, ‘If it 
is raining or it is snowing, and it is not snowing, then it is raining’, which 
arises by substitution from the tautology (((/4 \f B) A hB)) => A). A sen- 
tence that comes from a contradictory statement form by means of sub- 
stitution is said to be logically false (according to the propositional calculus). 

Now let us prove a few general facts about tautologies. 

^By a formal theory we mean an artificial language in which the notions of 
meaningful expressions, axioms and rules of inference are precisely described (see page 
34). 




PROPOSITION 1.2 


If 88 and [88 => are tautologies, then so is <8. 

-a 


Proof 

Assume that 88 and [88 =>• ( 8) are tautologies. If ( € took the value F for some 
assignment of truth values to the statement letters of 88 and ( 8, then, since 88 
is a tautology, 88 would take the value T and, therefore, [88 ■=> *8) would 
have the value F for that assignment. This contradicts the assumption that 
[c% =*. <8) is a tautology. Hence, 8 never takes the value F. 

PROPOSITION 1.3 

If is a tautology containing as statement letters A \ , A 2 , . . . ,A„, and 88 
arises from 8T by substituting statement forms . . . , 8f n for 

Ai, A 2, • • • ,A„, respectively, then 88 is a tautology; that is, substitution in a 
tautology yields a tautology. 

Example 

Let ST be {{A\ AA 2 ) => A \ ), let SA\ be [B V C) and let 88 2 be (C A D). Then 88 
is (((. B V C) A (C A D)) =► [B V C)) . 


Proof 

Assume that is a tautology. For any assignment of truth values to the 
statement letters in 88, the forms 88 1 , . . . , 88 n have truth values jq , . . . , x„ 
(where each x„ is T or F). If we assign the values jq, . . . ,x„ to A\, . . . ,A„, 
respectively, then the resulting truth value of 88 is the truth value of 88 for 
the given assignment of truth values. Since is a tautology, this truth value 
must be T. Thus, 88 always takes the value T. 


PROPOSITION 1.4 

If 8\ arises from 88 \ by substitution of 8 for one or more occurrences of 88, 
then [[88 => [88\ & ^ 1 )) is a tautology. Hence, if 88 and *8 are logi- 

cally equivalent, then so are 88 \ and *8\. 

Example 

Let 88\ be (CVD), let 88 be C, and let *8 be (-i(-iC)). Then V\ is 
((->(-iC)) V D). Since C and (->(-' C)) are logically equivalent, (CvD) and 
((-<(-iC)) V D) are also logically equivalent. 



Proof 




Consider any assignment of truth values to the statement letters. If 38 and # 
have opposite truth values under this assignment, then (38 44 <3) takes the 
value F, and, hence, ({38 44 *3) =>■ (38 \ 44 c 3y)) is T. If 38 and take the 
same truth values, then so do 38 \ and ( 3\ , since ^ differs from 38 \ only in 
containing *3 in some places where 38 \ contains 38. Therefore, in this case, 
(38 44 <3) is T, (38 x 44 ^i) is T, and, thus, ((38 44 < 3)=4 (38\ 44 ^i)) is T. 


Parentheses 

It is profitable at this point to agree on some conventions to avoid the use of 
so many parentheses in writing formulas. This will make the reading of 
complicated expressions easier. 

First, we may omit the outer pair of parentheses of a statement form. (In 
the case of statement letters, there is no outer pair of parentheses.) 

Second, we arbitrarily establish the following decreasing order of strength 
of the connectives: A, V, Now we shall explain a step-by-step 

process for restoring parentheses to an expression obtained by eliminating 
some or all parentheses from a statement form. Find the leftmost occurrence 
of the strongest connective that has not yet been processed. 

(i) If the connective is -i and it precedes a statement form 38, restore left 
and right parentheses to obtain (-*38) . 

(ii) If the connective is a binary connective C and it is preceded by a 
statement form 38 and followed by a statement form S>, restore left and 
right parentheses to obtain (38 C 

(iii) If neither (i) nor (ii) holds, ignore the connective temporarily and find 
the leftmost occurrence of the strongest of the remaining unprocessed 
connectives and repeat (i)— (iii) for that connective. 

Examples 

Parentheses are restored to the expression in the first line of each of the 
following in the steps shown: 

1 . A 44 (~<B) V C =>- A 
A 44 ((^B) V C) =4 A 
A 44 (((^B)V C) =4 A) 

(A & (((^B) V Q => A)) 

2. A=>-^B=>C 
A =* (-*B) =>- C 
(A => (-nB)) => C 
(( A =>■ (-B)) =!• C) 

3. B=s>-.-u4 

B ^ A ) 






R=$ 

( B => ( _, (" v ^))) 

| AV<B^AVB) 

A V - , (B t =^ (A V R)) 

AVHB=> (AVB))) 

(A V {->{B => (A V B)))) 

Not every form can be represented without the use of parentheses. For 
example, parentheses cannot be further eliminated from A => (B => C), since 
I g =$. C stands for ((A => B) => C). Likewise, the remaining parentheses 
cannot be removed from -> (A VB ) or from A A (B => C). 

Exercises 

1JI5 Eliminate as many parentheses as possible from the following forms. 

'(a) ((B => (-*<)) A C) (e) ((A « B) « (-.(C V D ))) 

"(b) (/l V (B V C)) (f) V C)))) » (B » C)) 

(c) (((^ A (-J)) A C) V D) (g) HH-(B VC)))«(S# C))) 

(d) ((B V (— 'C)) V A B)) (h) ((((A => B) (C => D)) A (-v<)) V C) 

L16 Restore parentheses to the following forms. 

(a) CV ~>A A B (c ) C => -<(/l A R => C) Av4 <*=> R 

(b) R =^> — i — i - 1/4 A C (d) C = r > ^4 ^4 v4 V R 

1.17 Determine whether the following expressions are abbreviations of 
statement forms and, if so, restore all parentheses. 

(a) -rA&A&BVC (d) A & (->A V B) => (AA(BvC))) 

(b) i (— \A O A) O R V C (e) -A \ZBvCAD<&AA —>A 

(c) -i(^4 =>R)VCVZ)=^R (f) ((A => B A (C V D) A (A V D)) 

1.18 If we write ->Aft instead of (->Aft), => Aft ( € instead of (Aft => Yi) , AAftAi 
ihstead of (Aft A ( A), NAft'A instead of (AftM ( €), and AtA€ instead of (Aft ( A), 
then there is no need for parentheses. For example, ((- A ) A (R => (->/)))), 
which is ordinarily abbreviated as ~>A A (R => ->Z)), becomes A -k 4 => R-<Z). 
This way of writing forms is called Polish notation . 

(a) Write ((C => (-u4)) V R) and (C V ((R A (->£>)) => C)) in this notation. 

(b) If we count =>, a, V, and -$=> each as +1, each statement letter as —1 and 
-i as 0, prove that an expression Aft in this parenthesis-free notation is a 
statement form if and only if (i) the sum of the symbols of Aft is — 1 and 
(ii) the sum of the symbols in any proper initial segment of Aft is non- 
negative. (If an expression Aft can be written in the form ( S C Y, where 

Aft, then ^ is called a proper initial segment of Aft.) 

(c) Write the statement forms of Exercise 1.15 in Polish notation. 
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(d) Determine whether the following expressions are statement forms in 
Polish notation. If so, write the statement forms in the standard way. 

(i) =4* ABC V AB^C (iii) V A V-*A -nBC A VAC V ^C^A 

(ii) =4=4 AB =4=4 BC =4 -i AC (iv) V ABA BBB 

1.19 Determine whether each of the following is a tautology, is contra- 
dictory, or neither. 

(a) B&(BVB) if) A A A V B)) 

(b) {{A => B) AB) =4/1 (g) (A =4 B) 44 ((-A) V B) 

(c) (—i/l) =4 (A A B) (h) (A =4 B) 44 —i i(A A (— ' B)) 

(d ) (A => B) => ((B => C) => (A => C)) (i) [B^{B^A))^A 

(e) (A 44 ->B) =>AvB (j) AA^A^B 

1.20 If A and B are true and C is false, what are the truth values of the 
following statement forms? 

(a) A V C (e) BV ->C => A 

(b) A A C (f) {B VA) =4 (£ =4 -.C) 

(c) -A A ^C (g) (B =4 ->A) 44 (A 44 C) 

(d ) A <*^B VC (h) (£ =4 4l) =4 {{A =■> -nC) =4 (-C => B)) 

1.21 If A =4 B is T, what can be deduced about the truth values of the 
following? 

(a) AV C =4 BV C 

(b) A A C =4 B A C 

(c) —i A A B 44 A V B 

1.22 What further truth values can be deduced from those shown? 


(a) -A V (A =4 B ) (c) M V B) =>(A =4 -.C) 

F F 

(b) -,(A AS) 44 -u4 =4 -B (d)(/4 44 B) 4 4(C =4 -'A) 

T FT 

1.23 If A 44 B is F, what can be deduced about the truth values of the 

following? 


(a)AAB (b) A V B (c )A=>B (d) A AC 44 £ AC 

1.24 Repeat Exercise 1.23, but assume that A 44 B is T. 

1.25 What further truth values can be deduced from those given? 

(a) (A A B) 44 VB) 

F F 

(b) (A =4 -r B ) MC =* B) 

F 

1.26 (a) Apply Proposition 1.3 when is A\ =4 A\ V Ai, SA \ is B AD, and 
2 is -B. 

(b) Apply Proposition 1.4 when is (B =4 C) A D, 0S is B =4 C, and ^ is 
—*B V C. o 



TAUTOLOGIES 


-*r 



- <gj . sf 10 w that each statement form in column I is logically equivalent to 
the form next to it in column II. 


i XiV 

(a) A^(B^C) 

( b) AA(BVC) 

(c) A V (B A C) 
(cl) (AAB)V^B 
(e) (A V B) A ->£ 

|T) A^B 

(g) A & B 

(h) {A & B) <£> C 

(i) A &B 

(j) ~'(A ^ B) 

(k) 

(l) ->(A A B) 

(m) A V {A A #) 

(n) ^A(^V5) 

(o) A AB 

(p) AVB 

(q) (A A B) A C 

(r) (AVB)VC 


II 

(AAB) ^ C 
(A A B) V (A AC) 
(AVB) A (A VC) 
A V ~<B 
A A ~>B 

~~<B =>- —iA 

B & A 

A &(B&C) 

(A AB) V (-iA A - 
A 4A ^B 
(— i A) A (~<B) 

(~ i A) V (~<B) 

A 

A 

baa 

BV A 

A A (B AC) 

A V (BV C) 


(Distributive law) 
(Distributive law) 


(Law of the contrapositive) 
(Biconditional commutativity) 
(Biconditional associativity) 

B) 

(De Morgan’s law) 

(De Morgan’s law) 


(Commutativity of conjunction) 
(Commutativity of disjunction) 
(Associativity of conjunction) 
(Associativity of disjunction) 


L28 Show the logical equivalence of the following pairs. 


(a) S' a $1 and SS, where S' is a tautology. 

(hi F V 08 and 0 , where 0 is a tautology. 

(c) IF and ^ where IF is contradictory. 

(d) F V 3 and 03, where &■ is contradictory. 

1,29 

(a) Show the logical equivalence of -i (A B) and A A -<B. 

(b) Show the logical equivalence of ~*(A 44* B) and (A A ~>B) V (->A A B). 

(c) For each of the following statement forms, find a statement form that is 
logically equivalent to its negation and in which negation signs apply 
only to statement letters. 

(i) A => (B 4$ -iC) 

(ii) -vf V (B => C) 

(Hi) A A(BV ->C) 


1.30 (Duality) 

■*\ (a) If 0$ is a statement form involving only -i, A, and V, and 0$ results from 
03 by replacing each A by V and each V by A, show that 01 is a tautology 
if and only if ~>08' is a tautology. Then prove that, if 0$ <0 is a tau- 
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tology, then so is =>- 3?, and if 38 <*=> <€ is a tautology, then so is 
38' ^ . (Here ( S is also assumed to involve only -i, A and V.) 

(b) Among the logical equivalences in Exercise 1 .27, derive (c) from (b), (e) 
from (d), (1) from (k), (p) from (o), and (r) from (q). 

(c) If 38 is a statement form involving only A and V, and 38* results from 
38 by interchanging A and V and replacing every statement letter by its 
negation, show that 38* is logically equivalent to -■ 38. Find a statement 
form that is logically equivalent to the negation of (A v B v C) 
A(-i A V -i B\l D), in which -> applies only to statement letters. 

1.31 


(a) Prove that a statement form that contains as its only connective is a 
tautology if and only if each statement letter occurs an even number of 
times. 

(b) Prove that a statement form that contains -> and as its only con- 
nectives is a tautology if and only if -i and each statement letter occur an 
even number of times. 


1.32 (Shannon, 1938) An electric circuit containing only on off switches 
(when a switch is on, it passes current; otherwise it does not) can be re- 
presented by a diagram in which, next to each switch, we put a letter re- 
presenting a necessary and sufficient condition for the switch to be on (see 
Figure 1.1). The condition that a current flows through this network can be 
given by the statement form (A AB) v (C A ->A). A statement form re- 
presentating the circuit shown in Figure 1.2 is (A A B) V ((C V A) A 
which is logically equivalent to each of the following forms by virtue of the 


indicated logical equivalence of Exercise 1.27. 

((A AB) V (CV A)) A ((A AB) V -> 2 ?) (c) 

((A AB)V (Cv A)) A(Av ->B) (d) 

((AAB)V(AV C)) A (A V ~B) (p) 

(((A AB) V A) v C) A (A v ->B) (r) 

(Av C) A (A V ~>B) (p),(m) 

A V (C A B) (c) 


Hence, the given circuit is equivalent to the simpler circuit shown in Fig- 
ure 1.3. (Two circuits are said to be equivalent if current flows through one if 
and only if it flows through the other, and one circuit is simpler if it contains 
fewer switches.) 


A 

A 


- b \ 

-i A 


Figure. 1.1 





(a) Find simpler equivalent circuits for those shown in Figures 1.4, 1.5 and 

1 . 6 . 

(b) Assume that each of the three members of a committee votes yes on a 
proposal by pressing a button. Devise as simple a circuit as you can that 
will allow current to pass when and only when at least two of the 
members vote in the affirmative. 
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(c) We wish a light to be controlled by two different wall switches in a room 
in such a way that flicking either one of these switches will turn the light 
on if it is off and turn it off if it is on. Construct a simple circuit to do the 
required job. 

1.33 Determine whether the following arguments are logically correct by 
representing each sentence as a statement form and checking whether the 
conclusion is logically implied by the conjunction of the assumptions. (To 
do this, assign T to each assumption and F to the conclusion, and determine 
whether a contradiction results.) 

(a) If Jones is a communist, Jones is an atheist. Jones is an atheist. 
Therefore, Jones is a communist. 

(b) If the temperature and air pressure remained constant, there was no 
rain. The temperature did remain constant. Therefore, if there was rain, 
then the air pressure did not remain constant. 

(c) If Gorton wins the election, then taxes will increase if the deficit will 
remain high. If Gorton wins the election, the deficit will remain high. 
Therefore, if Gorton wins the election, taxes will increase. 

(d) If the number x ends in 0, it is divisible by 5. x does not end in 0. Hence, 
x is not divisible by 5. 

(e) If the number x ends in 0, it is divisible by 5. x is not divisible by 5. 
Hence, x does not end in 0. 

(f) If a — 0 or b = 0, then ab = 0. But ab ^ 0. Hence, a ^ 0 and b ^ 0. 

(g) A sufficient condition for / to be integrable is that g be bounded. A 
necessary condition for h to be continuous is that / is integrable. Hence, 
if g is bounded or h is continuous, then / is integrable. 

(h) Smith cannot both be a running star and smoke cigarettes. Smith is not 
a running star. Therefore, Smith smokes cigarettes. 

(i) If Jones drove the car, Smith is innocent. If Brown fired the gun, then 
Smith is not innocent. Hence, if Brown fired the gun, then Jones did not 
drive the car. 


1.34 Which of the following sets of statement forms are satisfiable, in the 
sense that there is an assignment of truth values to the statement letters that 
makes all the forms in the set true? 


(a) A => B 

B => C 

CvD&->B 

(b) -n(-nBVA) 

A V ->C) 

B =>■ —i C 

(c) D => B • y 

A V ~>B * 

-.(£) A A) 

?' D 
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1 35 Check each of the following sets of statements for consistency by re- 
presenting the sentences as statement forms and then testing their con- 
junction to see whether it is contradictory. 

(a) Either the witness was intimidated or, if Doherty committed suicide, a 
note was found. If the witness was intimidated, then Doherty did not 
commit suicide. If a note was found, then Doherty committed suicide. 

(b) The contract is satisfied if and only if the building is completed by 30 
November. The building is completed by 30 November if and only if the 
electrical subcontractor completes his work by 10 November. The bank 
loses money if and only if the contract is not satisfied. Yet the electrical 
subcontractor completes his work by 10 November if and only if the 
bank loses money. 


1.3 ADEQUATE SETS OF CONNECTIVES 

Every statement form containing n statement letters generates a corre- 
sponding truth function of n arguments. The arguments and values of the 
function are T or F. Logically equivalent forms generate the same truth 
function. A natural question is whether all truth functions are so generated. 


PROPOSITION 1.5 

Every truth function is generated by a statement form involving the con- 
nectives -i, A and V. 

Proof 

r 

(Refer to Examples 1 and 2 below for clarification.) Let /( jq, . . . , x„) be a 
truth function. Clearly f can be represented by a truth table of 2" rows, 
where each row represents some assignment of truth values to the variables 
xj , . . . , x n , followed by the corresponding value of f(x i, ...,x„). If 
1 let Q be the conjunction U\ A U l 2 A ... A U l n , where U'- is Aj if, in 

the /'th row of the truth table, xj takes the value T, and U‘- is - Aj if xj takes 
the value F in that row. Let D be the disjunction of all those C,s such that / 
has the value T for the fth row of the truth table. (If there are no such rows, 
then / always takes the value F, and we let D be A \ f\->A\, which satisfies the 
theorem.) Notice that D involves only A and V. To see that D has / as its 
corresponding truth function, let there be given an assignment of truth 
values to the statement letters A iy . . . , A„, and assume that the corrre- 
ponding assignment to the variables xi , . . . , x„ is row k of the truth table for 
/ . Then Q has the value T for this assignment, whereas every other C, has 
the value F. If f has the value T for row k , then Cjt is a disjunct of D. Hence, 
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D would also have the value T for this assignment. If / has the value F for 
row k , then C* is not a disjunct of D and all the disjuncts take the value F for 
this assignment. Therefore, D would also have the value F. Thus, D gen- 
erates the truth function /. 

Examples 

1 . 

*1 *2 f(x 1,X 2 ) 

T T F 

FT T 
T F T 

F F T 

D is (— i \A\ A Aj) V (A i A ~ , A2) V ( — *-^ 4 i A — 'A2). 


A'2 -V3 

T T 

T T 

F T 

F T 

T F 

T F 

F F 

F F 


T 

F 

T 

T 

F 

F 

F 

T 


D is (A 1 A A2 A Aj ) V (A 1 A —1A2 A AjJ V (~ >A\ A ~ '^2 A ^3) 

V ( — 1 ^ 4 1 A — 1^2 A ~ 1/^3). 

Exercise 

1.36 Find statement forms in the connectives A and v that have the 
following truth functions. 

*1 *2 * 3 /( A'i,*2,X 3 ) Q(x 1,^2, X 3 ) / 7 (xi,X 2 ,X 3 ) 

T T T T T F 

F T T T T T 

T F T T T F 

F F T F F F 

T T F F T T 

F T F F F T 

T F F F T F 

F F F T F T 


COROLLARY 1.6 


Every truth function can be generated by a statement form containing as 
connectives only A and or only v and -1, or only => and 
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proof 

\ 

Ndtioe that V ^ is logically equivalent to A -fiS). Hence, by the 

ond part of Proposition 1.4, any statement form in A, V and -« is logically 
equivalent fb a statement form in only A and [obtained by replacing all 
expressions ^ V ^ by ->(->$ A -><&)\. The other parts of the corollary are 
sjniilar consequences of the following tautologies: 

3ft A A3 - 1(— >3ft V - 1*^) 

# V ^ (-.# ) 

3ft a<£a3 -,(& => -*?) 

We have just seen that there are certain pairs of connectives - for ex- 
'trhple, A and -> — in terms of which all truth functions are definable. It turns 
oiit that there is a single connective, [ (joint denial), that will do the same 
job. Its truth table is: 

J A B A IB 

T T F 

FT F 

T F F 

F F T 


A \g is true when and only when neither A nor B is true. Clearly, 
i A A 3 (A l A) and (A AB) A3 (( A j A) 1 (B [ B)) are tautologies. Hence, the 
adequacy of | for the construction of all truth functions follows from 
Corollary 1.6. 

Another connective, | (alternative denial), is also adequate for this pur- 
pose. Its truth table is 

A B A\B 
F 
T 
T 
T 

/F| B is true when and only when not both A and B are true. The adequacy of 
| follows from the tautologies ->A A3 (A | A) and (A VB) A3 ((A l A) | 
(B\B)). 


T T 
F T 
T F 
F F 


A 

"PROPOSITION 1.7 


The only binary connectives that alone are adequate for the construction of 
all truth functions are l and I. 


Assume that h(A , B ) is an adequate connective. Now, if /?(T,T) were T, then 
any statement form built up using h alone would take the value T when all 
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its statement letters take the value T. Hence, -> A would not be definable in 
terms of h. So, /?(T,T) = F. Likewise, /7(F,F) = T. Thus, we have the partial 
truth table 

A B h (A ,Z?) 

T T F 

F T 

T F 

F F T 

If the second and third entries in the last column are F, F or T, T, then h is | 
or |. If they are F, T, then h(A,B) ->B is a tautology; and if they are T, F, 
then h(A,B ) <$=> -'A is a tautology. In both cases, h would be definable in 
terms of -i. But -i is not adequate by itself because the only truth functions 
of one variable definable from it are the identity function and negation itself, 
whereas the truth function that is always T would not be definable. 


Exercises 

1.37 Prove that each of the pairs =>, V and -i, -<=> is not alone adequate to 
express all truth functions. 

1.38 

(a) Prove that A V B can be expressed in terms of => alone. 

(b) Prove that A a B cannot be expressed in terms of =>• alone. 

(c) Prove that A<=> B cannot be expressed in terms of =>- alone. 

1.39 Show that any two of the connectives {A, =>, <s=>} serve to define the 
remaining one. 

1.40 With one variable A, there are four truth functions: 

A -i A A V ~>A A A -A 
TFT F 

F T T F 

(a) With two variable A and B, how many truth functions are there ? 

(b) How many truth functions of n variables are there ? 

1.41 Show that the truth function h determined by (A V B) =>• -»C generates 
all truth functions. 

1.42 By a literal we mean a statement letter or a negation of a statement 
letter. A statement form is said to be in disjunctive normal form (dnf) if it is 
a disjunction consisting of one or more disjuncts, each of which is a 
conjunction of one or more literals - for example, (A A B) v (~iA A C) , 
(A A B A - A ) V (C A ~iB) V (A A ->C), A, A A B, and Av (Bv C). A form is 
in conjunctive normal form (cnf) if it is a conjunction of one or more conjuncts, 
each of which is a disjunction of one or more literals - for example, 
(. B V C)A(A V B), (. B V — iC) A (AvD), A A (B V A) A (S VA), A V -<B, AA 
B, A. Note that our terminology considers a literal to be a (degenerate) con- 
junction and a (degenerate) disjunction. 
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is 


(< ) The proof of Proposition 1.5 shows that every statement form 

logically equivalent to one in disjunctive normal form. By applying this 
result to prove that PA is also logically equivalent to a form in 


conjunctive normal form. 

,fo) pj n d logically equivalent dnfs and cnfs for ~>(A => B) V (-'A A C) and 
j ^ Mg a -i A) V C). [Hint: Instead of relying on Proposition 1.5, it is 
usually easier to use Exercise 1.27(b) and (c).] 

(cj A dnf (cnf) is called full if no disjunct (conjunct) contains two occur- 
rences of literals with the same letter and if a letter that occurs in one 
disjunct (conjunct) also occurs in all the others. For example, 
(A A ~A A B) V (A A B), (B A B A C) V (B A C) and (B A C) V B are not 
full, whereas (A A B A - 'C) V (A A B A C) V (A A — i B A — iC) and (A A ~>B) 
V(B A A) are full dnfs. 

(i) Find full dnfs and cnfs logically equivalent to (A A B) V ->A and 
~,(A =* B) V (~>A A C). 

(ii) Prove that every non-contradictory (non-tautologous) statement 
form PS is logically equivalent to a full dnf (cnf) ( €, and, if PP contains 
exactly n letters, then PA is a tautology (is contradictory) if and only 
if has 2" disjuncts (conjuncts). 

(d) For each of the following, find a logically equivalent dnf (cnf), and then 
find a logically equivalent full dnf (cnf), 


(i) (A V B) A (~>B V C) (iii) (A A ->B) V (A A C) 

(ii) —A V (B =s* —i C) (iv) (A V B) 4P- -r C 

(e) Construct statement forms in -i and A (respectively, in -i and V or in -i 
and =>■) logically equivalent to the statement forms in (d). 


1.43 A statement form is said to be satisfiable if it is true for some as- 
signment of truth values to its statement letters. The problem of determining 
the satisfiability of an arbitrary cnf plays an important role in the theory of 
computational complexity; it is an example of a so-called Jf PS -complete 
problem (see Garey and Johnson, 1978). 

(a) Show that P8 is satisfiable if and only if -\P& is not a tautology. 

(b) Determine whether the following are satisfiable: 

(i) (AVB) A (-A VB V C) A (~<A V -Bv ->C) 

(ii) ((A => B) V C) <=> (-.B A (A V C)) 

(c) Given a disjunction PA of four or more literals: L\ \l Lj V . . . V L „ , let 
Cl, . . . , C„_ 2 be statement letters that do not occur in PA, and construct 
the cnf S \ 


(L\ V Z .2 V Cj) A (~iCi V X 3 V C 2 ) A (“ 1 C 2 VL 4 VC 3 ) A... 

A (->C „_3 V V C„_ 2 ) A (~>C „_2 V L n V - 1 C 1 ) 

Show that any truth assignment satisfying PA can be extended to a truth 
assignment satisfying S and, conversely, any truth assignment satisfying 
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$ is an extension of a truth assignment satisfying 98. (This permits the 
reduction of the problem of satisfying cnfs to the corresponding pro- 
blem for cnfs with each conjunct containing at most three literals.) 

(d) For a disjunction of three literals L\ V L2 V £3, show that a form that 
has the properties of $ in (c) cannot be constructed, with $ a cnf in 
which each conjunct contains at most two literals (R. Co wen). 


1.44 {Resolution) Let 98 be a cnf and let C be a statement letter. If C is a 
disjunct of a disjunction 98 \ in 98 and ->C is a disjunct of another disjunction- 
@2 in 98, then a non-empty disjunction obtained by eliminating C from Sj 
and -i C from 98>2 and forming the disjunction of the remaining literals 
(dropping repetitions) is said to be obtained from 98 by resolution on C. Foil 
example, if 38 is 

{AV~>CV ~^B) A (~iA VD V -1. B ) A (C V Z> V A), 

the first and third conjuncts yield A v ~<B V D by resolution on C. In addi- 
tion, the first and second conjuncts yield ->C v -*B V D by resolution on A, 
and the second and third conjuncts yield D V -1 B V C by resolution on A. If 
we conjoin to 38 any new disjunctions obtained by resolution on all vari- 
ables, and if we apply the same procedure to the new cnf and keep on 
iterating this operation, the process must eventually stop, and the final result 
is denoted 98e^{98). In the example, 9?.£d{98) is: 

(A V ->C V ->B) A (~>A VZ) V-iB) A (C VD VA) A (~>C V ->B V D) 

A (D V ->B V C) A (A V ~>B V D) A (D V ->B) 


(Notice that we have not been careful about specifying the order in which 
conjuncts or disjuncts are written, since any two arrangements will be lo- 
gically equivalent.) 

(a) Find when 98 is each of the following: 

(i) {A V ~*B) A B 

(ii) (A V B v C) A (A V - 1 B V C) 

(iii) (A VC) A (-A v B) A (A V ->C) A (~>A V ->B) 

(b) Show that 98 logically implies 3$£d{38). 

(c) If 98 is a cnf, let 38c be the cnf obtained from 98 by deleting those 
conjuncts that contain C or -1 C. Let t'c{98) be the cnf that is the con- 
junction of 98 c and all those disjunctions obtained from 98 by resolution 
on C. For example, if 98 is the cnf in the example above, then f'c{98) is 
(-1 A vDv ~>B) A (A v ~>B V D ) . Prove that, if r c {98) is satisfiable, then so 
is 98. (R. Co wen) 

(d) A cnf 98 is said to be a blatant contradiction if it contains some letter C 
and its negation ~>C an conjuncts. An example of a blatant contradiction 
is {A v B) A B A (C V D) A ->B. Prove that if 98 is unsatisfiable, then 
98.£c{98) is a blatant contradiction. [Hint: Use induction on the number n 
of letters that occur in 38. In the induction step, use (c).] 
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(e) Prove that 0& is unsatisfiable if and only if Meo{$) is a blatant contra- 
diction. 


1 45 Let $ and Q) be statement forms such that =>• 2 is a tautology. 

j If $ and Q> have no statement letters in common, show that either 38 is 
contradictory or ^ is a tautology. 

(b) (Craig's interpolation theorem ) If 38 and Q) have the statement letters 
B\ , B„ in common, prove that there is a statement form having 
g . ? B„ as its only statement letters such that @1 m and ^ f A are 

tautologies. 

fc) Solve the special case of (b) in which 38 is (B x =>- A) A (A B 2 ) and Q) is 
(B\ A C) =>- (B 2 A C ). 

1.46 

/ a » a certain country is inhabited only by truth-tellers (people who always 
tell the truth) and liars (people who always lie). Moreover, the in- 
habitants will respond only to yes or no questions. A tourist comes to a 
fork in a road where one branch leads to the capital and the other does 
not. There is no sign indicating which branch to take, but there is a 
native standing at the fork. What yes or no question should the tourist 
ask in order to determine which branch to take ? [Hint: Let A stand for 
‘You are a truth-teller’ and let B stand for ‘The left-hand branch leads to 
the capital’. Construct, by means of a suitable truth table, a statement 
form involving A and B such that the native’s answer to the question as 
to whether this statement form is true will be yes when and only when B 
is true.] 

(b) In a certain country, there are three kinds of people: workers (who 
always tell the truth), businessmen (who always lie), and students (who 
sometimes tell the truth and sometimes lie). At a fork in the road, one 
branch leads to the capital. A worker, a businessman and a student are 
standing at the side of the road but are not identifiable in any obvious 
way. By asking two yes or no questions, find out which fork leads to the 
capital (Each question may be addressed to any of the three.) 



More puzzles of this kind may be found in Smullyan (1978, chap. 3; 1985, 
^chaps 2, 4-8). 


L4 AN AXIOM SYSTEM FOR THE PROPOSITIONAL CALCULUS 

Truth tables enable us to answer many of the significant questions con- 
cerning the truth-functional connectives, such as whether a given statement 
form is a tautology, is contradictory, or neither, and whether it logically 
implies or is logically equivalent to some other given statement form. The 
more complex parts of logic we shall treat later cannot be handled by truth 
tables or by any other similar effective procedure. Consequently, another 
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approach, by means of formal axiomatic theories, will have to be tried. 
Although, as we have seen, the propositional calculus surrenders completely 
to the truth table method, it will be instructive to illustrate the axiomatic 
method in this simple branch of logic. 

A formal theory df is defined when the following conditions are satisfied: 

1. A countable set of symbols is given as the symbols of ddK A finite se- 
quence of symbols of df is called an expression of df. 

2. There is a subset of the set of expressions of df called the set of well- 
formed formulas (wfs) of df. There is usually an effective procedure to 
determine whether a given expression is a wf. 

3. There is a set of wfs called the set of axioms of df. Most often, one can 
effectively decide whether a given wf is an axiom; in such a case, df is 
called an axiomatic theory. 

4. There is a finite set R\ , of relations among wfs, called rules of 

inference. For each Rj, there is a unique positive integer j such that, for 
every set of j wfs and each wf dd, one can effectively decide whether the 
given j wfs are in the relation Rj to dd, and, if so, dd is said to follow from 
or to be a direct consequence of the given wfs by virtue of /q. 

A proof in df is a sequence dd\, . . . , ddu of wfs such that, for each i, either 
ddj is an axiom of df or ddj is a direct consequence of some of the preceding 
wfs in the sequence by virtue of one of the rules of inference of df. 

A theorem of df is a wf dd of df such that dd is the last wf of some proof in 
df. Such a proof is called a proof of dd in df. 

Even if df is axiomatic - that is, if there is an effective procedure for 
checking any given wf to see whether it is an axiom - the notion of ‘theorem’ 
is not necessarily effective since, in general, there is no effective procedure 
for determining, given any wf dd, whether there is a proof of dd. A theory for 
which there is such an effective procedure is said to be decidable ; otherwise, 
the theory is said to be undeciclable. 

From an intuitive standpoint, a decidable theory is one for which a 
machine can be devised to test wfs for theoremhood, whereas, for an un- 
decidable theory, ingenuity is required to determine whether wfs are theo- 
rems. 

A wf ^ is said to be a consequence in dd of a set of T of wfs if and only if 
there is a sequence dd\, . . . , dd^ of wfs such that ( f is ddk and, for each /, 
either ddj is an axiom or ddj is in T, or ddj is a direct consequence by some rule 


t These ‘symbols’ may be thought of as arbitrary objects rather than just lin- 
guistic objects. This will become absolutely necessary when we deal with theories 
with uncountably many symbols in Section 2.12. 

* An example of a rule of inference will be the rule modus ponens (MP): follows 
from dd and dd =>- ( f. According to our precise definition, this rule is the relation 
consisting of all ordered triples { Pd, Pd f, f ), where Pd and f> are arbitrary wfs 
of the formal system. 
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if inference of some of the preceding wfs in the sequence. Such a sequence is 
called a proof (or deduction ) of from F. The members of Y are called the 
hypotheses or premisses of the proof. We use Y b # as an abbreviation for <c € 
is a consequence of T\ In order to avoid confusion when dealing with more 
than one theory, we write T V<? adding the subscript to indicate the 
stheory in question. 

[f F is a finite set . . . , 33f m }, we write Jfi, . . . , b <3 instead of 

r r j ' ^ b ^f. If r is the empty set 0, then 0 b # if and only if ^ is a 

- theorem. It is customary to omit the sign ‘0’ and simply write 1- c i. Thus, b 
is another way of asserting that is a theorem. 

The following are simple properties of the notion of consequence: 


I if F C A and Y 1- ( £, then A b <€. 

n r i- (g if and only if there is a finite subset A of Y such that A b ( A. 

3 If A b c 3, and for each 33 in A, Y \- 38, then Y b ( i. 

Assertion 1 represents the fact that if ( £ is provable from a set Y of pre- 

misses, then, if we add still more premisses, ^ is still provable. Half of 2 
follows from 1 . The other half is obvious when we notice that any proof of # 
from F uses only a finite number of premisses from F. Proposition 3 is also 
quite simple: if # is provable from premisses in A, and each premiss in A is 
provable from premisses in F, then ( £ is provable from premisses in F. 

We now introduce a formal axiomatic theory L for the propositional 
calculus. 

1 . The symbols of L are =*►, (, ), and the letters A t with positive integers 
i as subscripts: A i, Aj, A 3 , .... The symbols -> and =>- are called pri- 
mitive connectives, and the letters A,- are called statement letters. 

2. (a) All statement letters are wfs. 

(b) If 33 and ( A are wfs, then so are {->33) and {33 =$■ r €)3 

Thus, a wf of L is just a statement form built up from the statement 
letters A,- by means of the connectives -1 and =>. 

3 . If 33, & and 3> are wfs of L, then the following are axioms of L: 

(Al) {33 => (V => 33)) 

(A2) {{33 => if€ =*► 9)) => {{33 => <i) =* {33 =* 9))) 

(A3) (((-ft) =*■ (-#)) =* (((-ft) =* BS) ^ <t)) 

4. The only rule of inference of L is modus ponens : ( € is a direct con- 
sequence of 33 and {33 => < 6 f ). We shall abbreviate applications of this 
rule by MP.* 


We shall use our conventions for eliminating parentheses. 


'To be precise, we should add the so called extremal clause: (c) An expression is 
a wf if and only if it can be shown to be a wf on the basis of clauses (a) and (b). This 
can be made rigorous using as a model the definition in footnote § on page 13. 

*A common English synonym for modus ponens is the detachment rule. 
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Notice that the infinite set of axioms of L is given by means of three 
axiom schemas (A1)-(A3), with each schema standing for an infinite number 
of axioms. One can easily check for any given wf whether or not it is an 
axiom; therefore, L is axiomatic. In setting up the system L, it is our in- 
tention to obtain as theorems precisely the class of all tautologies. 

We introduce other connectives by definition: 


(Dl) {@ A V) for --(# =* ~fi) 

(Dl) V <#) for (-.#) =* 

(D3) \m <g) for (® =* <g) A (# =* ®) 

The meaning of (Dl), for example, is that, for any wfs ® and 'A, \® A f A)’ is 
an abbreviation for => . 


LEMMA 1.8 for all wfs M. 



Proof' 

We shall construct a proof in L of ® 

1. ((® => ®)) => 

((® => \® =* ®]) ®)) 

2. m => ((& 

3. ( ® => ( ® ~y -> ( ® =y M) 

4. ® =}► (® => ®) 


Instance of axiom schema (A2) 

Axiom schema (Al) 

From 1 and 2 by MP 
Axiom schema (Al) 


tThe word ‘proof’ is used in two distinct senses. First, it has a precise meaning 
defined above as a certain kind of finite sequence of wfs of L. However, in another 
sense, it also designates certain sequences of the English language (supplemented by 
various technical terms) that are supposed to serve as an argument justifying some 
assertion about the language L (or other formal theories). In general, the language 
we are studying (in this case, L) is called the object language, while the language in 
which we formulate and prove statements about the object language is called the 
metalanguage. The metalanguage might also be formalized and made the subject of 
study, which we would carry out in a metametalanguage, and so on. However, we 
shall use the English language as our (unformalized) metalanguage, although, for a 
substantial part of this book, we use only a mathematically weak portion of the 
English language. The contrast between object language and metalanguage is also 
present in the study of a foreign language; for example, in a Sanskrit class, Sanskrit is 
the object language, while the metalanguage, the language we use, is English. The 
distinction between proof and metaproof (i.e., a proof in the metalanguage) leads to a 
distinction between theorems of the object language and metatheorems of the me- 
talanguage. To avoid confusion, we generally use ‘proposition’ instead of ‘me- 
tatheorem’. The word ‘metamathematics’ refers to the study of logical and 
mathematical object languages; sometimes the word is restricted to those investiga- 
tions that use what appear to the metamathematician to be constructive (or so-called 
finitary) methods. 
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From 3 and 4 by MP* 



Exercise 
1.47 Prove: 

(a) El ^ ^ ^ 

(b) '3 => m ® 

M # => (tf =S- ®) I- L « =>• (® =*■ S) 

ffrl-L (-« =* -^) =* (# =* «) 

In mathematical arguments, one often proves a statement on the as- 
sumption of some other statement 3ft and then concludes that ‘if 38, then < E” 
Is true. This procedure is justified for the system L by the following theorem. 


PROPOSITION 1.9 (DEDUCTION THEOREM)* 

#C is a set of wfs and 3ft and are wfs, and T, ^ h r E, then r I- 3ft => ( 3. In 
particular, if 3ft E #, then E 3ft ^ (Herbrand, 1930). 


Let #j, . . . , be a proof of form Tu{^}, where r E„ is < E. Let us prove, by 
induction on j, that Tl-f for 1 < j^n. First of all, ( €\ must be either 
in T or an axiom of L or 3ft itself. By axiom schema (Al), < E ] =$■ {3ft => 'Ei) is 
ah axiom. Hence, in the first two cases, by MP, T E 3ft => 'Ej. For the third 
ease, when 'Ej is 3ft, we have 3ft => c 3\ by Lemma 1.8, and, therefore, 
r E This takes care of the case y = l. Assmne now that 

E E 3ft => for all k < j. Either 'Ey is an axiom, or 'Ey is in T, or 'Ey is 3ft, or 

%j follows by modus ponens from some 'Eg and If,,,, where l < j, in < j, and 
Y; ;/ f has the form ^ => 'Ey. In the first three cases, F E 3ft => 'Ey as in the case 
/— 1 above. In the last case, we have, by inductive hypothesis, T E 3ft =$■ 'Ef 
and r E 3ft => (*$( =s> 'Ey). But, by axiom schema (A2), E (3ft => ('Ef =s> 'Ey)) 
:> ((3ft =3 <g t ) ^(3ft^> 'Ey)). Hence, by MP, r E (3ft =J> <g e ) => (3ft =* 'Ey), 
and, again by MP, rE^^ 'Ey. Thus, the proof by induction is complete. 
The case j = n is the desired result. [Notice that, given a deduction of *E from 
sF and 3ft , the proof just given enables us to construct a deduction of 3ft =$■ 


^The reader should not be discouraged by the apparently unmotivated step 1 of 
The proof. As in most proofs, we actually begin with the desired result, 38 3ft, and 
then look for an appropriate axiom that may lead by MP to that result. A mixture of 
ingenuity and experimentation leads to a suitable instance of axiom (A2). 

•For the remainder of the chapter, unless something is said to the contrary, we 
shall omit the subscript L in \- L . In addition, we shall use T, 3ft E <E to stand for 
Eu{.^} E <E. In general, we let T, 3ft \ , . . . , 3ft „ E stand for F u{i#] , . . . , 3ft „} E 
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from r. Also note that axiom schema (A3) was not used in proving the 
deduction theorem.] 


COROLLARY 1.10 


(a) m =*► <£ =* 3) b A 

(b) & =* (# =* <€ b 


Proof 

For part (a): 


1. m 

2 . <€ 

3. m 

4. <€ 

5. ^ 


Hyp (abbreviation for ‘hypothesis’) 

Hyp 

Hyp 

1, 3, MP 

2, 4, MP 


Thus, & # , # => b So, by the deduction theorem, 

@ 3 } ^ < 3 . 

To prove (b), use the deduction theorem. 


LEMMA 1.11 


For any wfs & and ( £, the following wfs are theorems of L. 


(a) =* m 

(e) (# => tf) =*► (-.# => -.#) 

(b) m ^ ->-& 

(f ) m => (-& => =* #)) 

(c) -^9 => => <i) 

(d) (-.# =* -.#) 

(g) (^ => <g) => ((-.^ => ^) 

Proof 


(a) b -.-^9 => &) 


i. {-ids =* -r-a) => ((-«# => 

- -i^?) =>. ^) Axiom (A3) 

2. -& => 

Lemma 1.8 t 

3. ( — =>• "'—'3}) 

1 , 2, Corollary 1 . 1 0(b) 

4. -.-^9 => (-^9 => -.--^9) 

Axiom (Al) 

5. => m 

3, 4, Corollary 1.10(a) 


% 


1 Instead of writing a complete proof of we simply cite Lemma 1.8. 

In this way, we indicate how the proof of ->-i$ => $ could be written if we wished to 
take the time and space to do so. This is, of course, nothing more than the ordinary 
application of previously proved theorems. 
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h 38 =4> 


1. (-r^® ->&) => 

Axiom (A3) 

((-nn^ =4> -i-^9) 


2, — in^J 1 => 

Part (a) 

3 . =4 >38)=* ->->38 

1, 2, MP 

4. 38 (->->->38 => 38) 

Axiom (Al) 

5. 38 => 

3, 4, Corollary 1.10(a) 

b (# => < 3 ) 


1. ^38 

Hyp 

2. 38 

Hyp 

3. •# =*► (“^ =* 

Axiom (Al) 

4. =» (~33 => -•#) 

Axiom (Al) 

5. => ^ 

2, 3, MP 

6 . =4> 

1, 4, MP 

7. (-# =4> -i^) => ((-^ =4> =4> tf) 

Axiom (A3) 

8. (-,# =4> #) =4> 

6, 7, MP 

9. ^ 

5, 8, MP 

10. -i 38, 38 b^ 

1-9 

11. -i^b ^ =4> # 

1 0, Deduction theorem 

12. b -t38 =^(38=>^) 

11, Deduction theorem 

b (-33 =4- -^) =* =4- tf) 


1. -fig=4>~& 

Hyp 

2. (-,# ^ -.^) =4> ((-^ =>38)=><3) 

Axiom (A3) 

3. # =4> (-^ =* 

Axiom (Al) 

4. (-.^ =4> 0) =4> 

1, 2, MP 

5. 38 ^<3 

3, 4, Corollary 1.10(a) 

6. — fll> => ~ I - ^ ^ 

1-5 

7. b (-^ =4, -^9) =4» =4> 

6, deduction theorem 

b =4> #) =4 (-.# =4 ^38) 


1 . 38 ^<3 

Hyp 

2. =4 38 

Part (a) 

3. =4- <g 

1 , 2, Corollary 1 . 1 0(a) 

4. -r-fig 

Part (b) 

5. =4 -.-i^ 

3, 4, Corollary 1.10(a) 

6. (-n# =4> -r-fif) =4 (-33 =4 -48) 

Part (d) 

7. -.^ =4 -># 

5, 6, MP 

8. 38 ^ <£ V -fg ^ ->38 

1-7 

9. b (# =4 <g) =4 (-fig =4 -.<#) 

8, deduction theorem 


((#=*#)=*<?)) by 


(0 b &=>(-&=> -.(a => &)). 

Clearly, 38, 38 => ^ 1- <3 by MP. Hence, b 
two uses of the deduction theorem. Now, by (e), b ((38 =4* f 3) 
=><3)=> (-fig => -.(# =i> #)). Hence, by Corollary 1.10(a), 

b 38 ^ (-,#=*-,(<#=*#)). 

(g) b (38 <g) ^ ((-.^ =4> <g) => <g) 
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2. -t# => 

3. (# => <€) =* (-f€ =* -.0) 

4. = :> -i^ 

5. =4>^) (-i^ =* -i^) 

6. -.tf =* -.-.# 

7. (-I<5f =* -.-^P) => ((-^ => -n^) =* tf) 

8. (-*? =* -<0) => # 

9. ^ 

10. a => <&, -& => v h <# 

11. m =># h (-& =*<#)=><# 

12. h (0S =>- <i) ^ ((-^ => #) => 


Hyp 
Hyp 
Part (e) 

1, 3, MP 
Part (e) 

2, 5, MP 
Axiom (A3) 

6, 7, MP 

4, 8, MP 
1-9 

10, deduction theorem 

11, deduction theorem 


Exercises 


1.48 Show that the following wfs are theorems of L. 


(a) m =* {08 V <6) 

(b) m => (# v 

(c) # V # => .# V # 

(d) mN^^m 


(e) 0 A => 

(f) =* 0) => ((# =* 0) ^ (# V ^ =* 0)) 

(g) ({£ 

(h) m =* (# => {& a «’)) 


1.49 Exhibit a complete proof in L of Lemma 1.11(c). [Hint: Apply the 
procedure used in the proof of the deduction theorem to the demonstration 
given earlier of Lemma 1.11(c).] Greater fondness for the deduction theorem 
will result if the reader tries to prove all of Lemma 1.11 without using the 
deduction theorem. 


It is our purpose to show that a wf of L is a theorem of L if and only if it 
is a tautology. Half of this is very easy. 


PROPOSITION 1.12 

Every theorem of L is a tautology. 


Proof 

As an exercise, verify that all the axioms of L are tautologies. By Proposi- 
tion 1.2, modus ponens leads from tautologies to other tautologies. Hence, 
every theorem of L is a tautology. 

The following lemma is to be used in the proof that every tautology is a 
theorem of L. 
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LEMMA 1.13 

I el $ be a wf and let 5], ...,£* be the statement letters that occur in 28. 
For a giveiFassignment of truth values to B\, . . . , B k , let Bj be Bj if Bj takes 
the value T; and let B' J be ->Bj if Bj takes the value F. Let 28 1 be if 28 takes 
the value T under the assignment, and let 28 1 be ->2t if 28 takes the value F. 

Then B‘ v • • • > ^ • 

For example, let 28 be — <(— <^2 => As). Then for each row of the truth table 


Ai 

T 

F 

T 

F 


As 

T 

T 

F 

F 


■(->^2 =4- As ) 


F 

F 

F 

T 


Lemma 1.13 asserts a corresponding deducibility relation. For instance, 
corresponding to the third row there is A 2} ~>A 5 I — >— ■(— *^2 A 5 ), and to the 
fourth row, ~Ai, ~^A 5 1 >(->A 2 As). 


Proof 

The proof is by induction on the number n of occurrences of -i and =$- in 28. 
(We assume 28 written without abbreviations.) If n = 0,28 is just a statement 
fetter B\, and then the lemma reduces to B\ h B\ and ~>B\ I — >B\. Assume 
now that the lemma holds for all j < n. 

Case 1. 28 is ~> ( A. Then ^ has fewer than n occurrences of -> and =^. 

Subcase la. Let # take the value T under the given truth value assign- 
ment. Then 28 takes the value F. So, ( 2’ is ( £ and 28' is ~>28. By the inductive 
hypothesis applied to ( f>, we have B\, ... ,B k \- ( g. Then, by Lemma 1.11(b) 
and MP, B\,...,B’ k V ^-fg. But -,-fg is 28'. 

Subcase lb. Let # take the value F. Then 28 takes the value T. So, is -fg 
and 2# is 28, By inductive hypothesis, B\ , ..., B’ k I — > ( i. But -fg is 2# . 

Case 2. 28 is 2? => 28. Then ( g and (A have fewer occurrences of -> and => 
than 28. So, by inductive hypothesis, B \ , . . . , B k h ( g' and B\, . . . , B' k F Of . 

Subcase 2a. <2 takes the value F. Then 28 takes the value T. So, is -i # 
and 2$ is 28. Hence, B[, ... , B k \ — i ( g. By Lemma 1. 1 1 (c) and MP, 
#!, , . . , B' k F <g => But <i => ® is 

Subcase 2b. 22 takes the value T. Then 28 takes the value T. So, 28' is 28 
and 28/ is 28. Hence, B\, , . . , B' k \- 28. Then, by axiom (Al) and MP, 
%, . . . , B' F <g But <g =* 0 is 28'. 


Subcase 2c. c £ takes the value T and 28 takes the value F. Then 28 takes 
the value F. So, f g' is < g, 28' is -i 28, and 28/ is ->28. Therefore, B\, . . . , B k \- <g 


and B\ . ..., B[.\ — >28. Hence, by Lemma 1 . 1 1 (f) and MP, B[, . . . , B’ k 


b - ' • 3 

(<g => ®), But 


(<g =* 28) is 


■ * * 
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If a wf 3d of L is a tautology, then it is a theorem of L. 

Proof 

(Kalmar, 1935) Assume 3d is a tautology, and let B\, ... , B k be the state- 
ment letters in 3d. For any truth value assignment to B\, . . . , B k , we have, by: 
Lemma 1.13, B \ , . . . , Bj,. h 3d. {3d 1 is 3d because 3d always takes the value T.) 
Hence, when B' k is given the value T, we obtain B \ , . . . , B' k .j , B k F 3d, and, 
when B k is given the value F, we obtain B \ , . . . , B' k _ l , -> B k F 3d. So, by the . 
deduction theorem, B \ , . . . , B' k __y F B k =^> 3d and B[, . . . , B' k _ x 1 — <B k => 3d. 
Then by Lemma 1.1 1(g) and MP, B[, . . . , B' k l F 3d. Similarly, B k ^\ may be 
chosen to be T or F and, again applying the deduction theorem. Lemma 
1 . 1 1(g) and MP, we can eliminate B' k _ { just as we eliminated B' k . After k such 
steps, we finally obtain F 3d. 

COROLLARY 1.15 

If ^ is an expression involving the signs -i, =^, A, V and that is an 
abbreviation for a wf 3d of L, then ^ is a tautology if and only if 3d is a 
theorem of L. 

Proof 

In definitions (Dl) (D3), the abbreviating formulas replace wfs to which 
they are logically equivalent. Hence, by Proposition 1.4 ■ 3d and ^ are lo- 
gically equivalent, and ^ is a tautology if and only if 3d is a tautology. The 
corollary now follows from Propositions 1.12 and 1.14. 

COROLLARY 1.16 

The system L is consistent; that is, there is no wf 3d such that both 3d and 
->3d are theorems of L. 

Proof 

By Proposition 1.12, every theorem of L is a tautology. The negation of a 
tautology cannot be a tautology and, therefore, it is impossible for both 3d 
and ^3d to be theorems of L. 

Notice that L is consistent if and only if not all wfs of L are theorems. In 
fact, if L is consistent, then there are wfs that are not theorems (e.g., the 
negations of theorems). On the other hand, by Lemma 1.11(c), 
F \_~ i 3d [3d => < 3), and so, if L were inconsistent, that is, if some wf 3d and 
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PROPOSITION 1.14 (COMPLETENESS THEOREM) 




INDEPENDENCE. MANY- VALUED LOGICS 



■ negation were provable, then by MP any wf Y would be provable. 
(This equivalence holds for any theory that has modus ponens as a rule of 
inference and in which Lemma 1 . 1 1 (c) is provable.) A theory in which not all 
vfs are theorems is said to be absolutely consistent , and this definition is 
applicable even to theories that do not contain a negation sign. 


Exercise 


4 i *>0 Let 38 be a statement form that is not a tautology. Let L + be the formal 
theory obtained from L by adding as new axioms all wfs obtainable from 38 
hv substituting arbitrary statement forms for the statement letters in 38, with 
.|Y same form being substituted for all occurrences of a statement letter. 
‘Show that L + is inconsistent. 


li5 INDEPENDENCE. MANY-VALUED LOGICS 

t 

A subset Y of the set of axioms of a theory is said to be independent if some 
wf-in Y cannot be proved by means of the rules of inference from the set of 
(hose axioms not in Y . 


PROPOSITION 1.17 

Each of the axiom schemes (A1)-(A3) is independent. 


Proof 

To prove the independence of axiom schema (AI), consider the following 
tables: 

A —A A B A =>- B 

0 1 0 0 0 

11 10 2 

2 0 2 0 0 

0 1 2 

1 1 2 

2 1 0 

0 2 2 

1 2 0 

2 2 0 

For any assignment of the values 0, 1 and 2 to the statement letters of a wf 
38, these tables determine a corresponding value of 38. If 38 always takes the 
value 0, 38 is called select. Modus ponens preserves selectness, since it is easy 
to check that, if 38 and 38 =4- # are select, so is ( 3. One can also verify that all 
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instances of axiom schemas (A 2 ) and (A 3 ) are select. Hence, any wf deni 
vable from (A2) and (A 3 ) by modus ponens is select. However, 
A i => ( A2 => Ay), which is an instance of (Al), is not select, since it takes the 
value 2 when A\ is 1 and A2 is 2 . 

To prove the independence of axiom schema (A 2 ), consider the following 
tables: 


A — 1 A ABA =■> B 

0 1 0 0 0 

10 10 0 

2 1 2 0 0 

0 1 2 

1 1 2 

2 1 0 

0 2 1 

1 2 0 

2 2 0 

Let us call a wf that always takes the value 0 according to these tables: 
grotesque. Modus ponens preserves grotesqueness and it is easy to verily 
that all instances of (Al) and (A 3 ) are grotesque. However, the instance 
(Ay =>- (A2 => A 3)) =>• (( A 1 => A2) => ( A 1 =^ 3 )) of (A 2 ) takes the value 2 
when A 1 is 0, A2 is 0, and A 3 is 1 and, therefore, is not grotesque. 

The following argument proves the independence of (A 3 ). Let us call a wf 
dft super if the wf h( 0 ) obtained by erasing all negation signs in & is a 
tautology. Each instance of axiom schemas (Al) and (A2) is super. Also, 
modus ponens preserves the property of being super; for if h(& %>) and 

h( 38 ) are tautologies, then /?(•?) is a tautology. (Just note that h{& => ( d) is 
h( 3 $) => h(%>) and use Proposition 1 . 2 .) Hence, every wf (A derivable from 
(Al) and (A 2 ) by modus ponens is super. But h{{~>Ai => ~A]) =$ 
{(-‘Ai => A{) =>- Ay)) is (Ay =>- Ay) => ((Ay => Ay) => Ay), which is not a 
tautology. Therefore, (-‘Ay => ->A 1) => ((-‘Ay =$■ Ay) ^ Ay), an instance of 
(A 3 ), is not super and is thereby not derivable from (Al) and (A 2 ) by modus 
ponens. 

The idea used in the proof of the independence of axiom schemas (Al) 
and (A 2 ) may be generalized to the notion of a many-valued logic. Select a 
positive integer n, call the numbers 0, 1 , truth values , and choose a 

number m such that 0 ^m < n. The numbers 0 , 1 , . . . , m are called desig- 
nated values. Take a finite number of ‘truth tables’ representing functions 
from sets of the form ( 0 , 1 , . . . , n} k into ( 0 , 1 , . . . , n}. For each truth table, 
introduces a sign, called the corresponding connective. Using these con- 
nectives and statement letters, we may construct ‘statement forms’, and 
every such statement form containing j distinct letters determines a ‘truth 
function’ from ( 0 , 1, . . . , n} J into { 0 , 1 , . . . , n}. A statement form whose 
corresponding truth function takes only designated values is said to be 
exceptional. The numbers m and n and the basic truth tables are said to 
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defiii e a (fi n i te ) many-valued logic M. A formal theory involving statement 
letters and the connectives of M is said to be suitable for M if and only if the 
theorems of the theory coincide with the exceptional statement forms of M. 
All these potions obviously can be generalized to the case of an infinite 
number of truth values. If n — l and m = 0 and the truth tables are those 
given f or and in section 1.1, then the corresponding two-valued logic is 
that studied in this chapter. The exceptional wfs in this case were called 
tautologies. The system L is suitable for this logic, as proved in Propositions 
1 12 and 1.14. In the proofs of the independence of axiom schemas (Al) and 
(A2), two three-valued logics were used. 


45 


Exercises 

j 5 j prove the independence of axiom schema (A3) by constructing ap- 
propriate ‘truth tables’ for and 

I 52 (McKinsey and Tarski, 1948) Consider the axiomatic theory P in which 
there is exactly one binary connective *, the only rule of inference is modus 
ponens (that is, W follows from & and 08 * (f), and the axioms are all wfs of 
the form Show that P is not suitable for any (finite) many- valued 

logic. 

1.53 For any (finite) many-valued logic M, prove that there is an axiomatic 
thedry suitable for M. 

Further information about many-valued logics can be found in Rosser 
and Turquette (1952), Rescher (1969), Bole and Borowik (1992) and Mal- 
inowski (1993). 


1.6 OTHER AXIOMATIZATIONS 

Although the axiom system L is quite simple, there are many other systems 
ithat would do as well. We can use, instead of -> and =>, any collection of 
primitive connectives as long as these are adequate for the definition of all 
jptjher truth-functional connectives. 


gkf: V and -i are the primitive conectives. We use 08 =>• # as an abbreviation 
for V c €. We have four axiom schemas: (1) 01 V 01 (2) 

SI =► 01 V <€\ (3) 88 V =* <e V 8B\ and (4) (# =» 9) =* {88 V <8 =* 
(Si V S>). The only rule of inference is modus ponens. Here and below we 
use the usual rules for eliminating parentheses. This system is developed 
in Hilbert and Acker mann (1950). 

1-2 : A and -i are the primitive connectives. 88 => is an abbreviation for 
~*{08 There are three axiom schemas: (1) 81 => {01 A (28); 
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(2) 3ft A ^ => 3ft; and (3) {3ft (~ , (^ A^) => -i(^ A ^)). Modus 

ponens is the only rule of inference. Consult Rosser (1953) for a detailed 
study. 

L3: This is just like our original system L except that, instead of the axiom 
schemas (Al) (A3), we have three specific axioms: (1) A\ (A2 => A\); 
(2) (A } (A 2 A 3 )) {{Aj => A 2 ) => (A 1 A 2 )); and (3) (-1 A 2 

-1A1) ((-<A 2 => A\) A 2 ). In addition to modus ponens, we have a 

substitution rule: we may substitute any wf for all occurrences of a 
statement letter in a given wf. 

L 4 : The primitive connectives are =>, A, V and Modus ponens is the only 
rule, and we have ten axiom schemas: (1) 3ft => {<& => 3ft); (2) {3ft ==>. 
(^ => 9)) -A {{3ft ^ <#) => (^ =» 0)); (3) ^ A ^ (4) 3ft A ^ => <€\ 

(5) ® =* (tf =-, (^ A #)); (6) ^ =>- V #); (7) (^ V tf); (8) {3ft 4 

9) => ((# 0) ^ { 3 ft v ^ =* 0)); (9) (^ =* ^) ((^ =» -.#) ^ -.#); 

and (10) ->-i38 This system is discussed in Kleene (1952). 


Axiomatizations can be found for the propositional calculus that contain 
only one axiom schema. For example, if -1 and are the primitive con- 
nectives and modus ponens the only rule of inference, then the axiom 
schema 


[{{{3ft =>• «*) (-.^ ->$)) => 0 ) =>&-]=> [(jf 3ft) ^ (s ^ 3ft)] 

is sufficient (Meredith, 1953). Another single-axiom formulation, due to 
Nicod (1917), uses only alternative denial | . Its rule of inference is: 9 
follows from 3ft \ {<# \ 9) and 3ft, and its axiom schema is 

m«m 1 mw)\ i 

Further information, including historical background, may be found in 
Church ( 1 956) and in a paper by Lukasiewicz and Tarski in Tarski ( 1 956, IV). 


Exercises 


1.54 (Hilbert and Ackermann, 1950) Prove the following results about the 
theory Li. 

(a) 3ft =><g\- Li 9\ 3ft ^ 9v ( £ 

(b) F L) {3ft => <<f) =* (( 3) =* 3ft) =>{9^ <&)) 

(c) 9^ 3ft, 0 ft => <€ F u 9 =* 

(d) F L\3ft =4>- 3ft (i.e., F Lj ~~y0ft V 3ft) 

(e) F u 3ft\j^3ft 

(f) \- u 3ft=>^3ft 

(g) F u -& ^{3ft=>V) 

(h) u 3ft V {<3 V 9) => {{<£ \/ {3ft \j 9)) V 3ft) 

(i) ^ U {^\J {3ft\J 9))y 3ft {3ft\j 9) 

(j) F u 3ft\J {<3\/9) =* <£\!{3ft V9) 






/k) h Li ^ ^ ^)) ^ ^ ^ ^)) 

m h‘L,(^ =» ^) =* ((# =*• ^) =* (^ =*■ #)) 

/m \^ =» (^ => ^), ^ =S> ^b u^ ^ {& =^ ^) 

(0) If r, u^’ then r I- Ll 38 =» # (deduction theorem) 

(p) # =» <$, -,^ => 3ft b L) # 

(q) b l, # if and onl y if ® is a tautology. 

1 5^ (Rosser, 1953) Prove the following facts about the theory L 2 . 

(a) 3ft =* ^ ^ b A ^) 

(b) b l 2 ~ ' (~ 1 A -^) 

(C) b L 2 ~'~~ , ^ ) ^ 

^d) b L 2 — '(-^ A *&’) =b (^ =4^ —| ^) 

(e) b l 2 ^ =* 

(f) b l 2 (.^ =► #) =» ("^ =*• 

(g) - 1 ^ =*• ~i# b 1.2 r A 38 

(h) 

|i) 3ft => <3, ^ ^ 38, 38 & u 38 => <f 

Jj) b t2 ^^^ 

(lc) b La^* A ^ ^ ^ f\ 3ft 

(1) 

(n) ^ =*• ^ b Lz 0 A <$ =$■ 3ft A 0 

(o) b l : (^ =* (tf =* &)) =* (0» A <g) ^ 38) 

(p) b ul(a a #) =» 0 ) => {08 => (# =» ^)) 

(q) # =* 

(r) b L2 ^=> A^) 

(S) b l 2 ^ => {<3 => 38) 

(t) If r, 38 b u <$, then rb L ^4^ (deduction theorem) 

(u) b L 2 (~ =>- 38) => 38 

(v) 3ft ^ c 3, - 1 ^ =£■ ^ b l 2 ^ 

(w) b l 2 38 if and only if 38 is a tautology. 

1.56 Show that the theory L 3 has the same theorems as the theory L. 

1.57 (Kleene, 1952) Derive the following facts about the theory L 4 . 

(a) b La& =£- 38 

(b) If F, 38 b L/) ^, then F b La 38 => <3 (deduction theorem) 

(c) 38 => <3, <3 23 u 3}=> 3ft 

(d) b l 4 {38 ^) => (—> ( 3 = 4 - - 1 38) 

(e) 3ft, - 1 3ft b l 4 ^ 

(f) b L4 ^ => -1 ->38 

(g) b u -& ^{38=><3) 

(h) b \^38 =$> (-M =» ^{38 => tf)) 

(i) b l 4 ~ ^ {~ ,< 3 =>• ~ '(^ V < & 7 )) 

G) b L4 (^=^^)^((^^^)^^) 

(k) b l 4 ^ if and only if 38 is a tautology. 
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1.58° Consider the following axiomatization of the propositional calculus 
<£ (due to Lukasiewicz). has the same wfs as our system L. Its only mj e 
of inference is modus ponens. Its axiom schemas are: 

(a) Aft) =$■ Aft 

(b) Aft c £) 

(c) {m =* <$) =s> ((& => ®) => {as =» ®)) 

Prove that a wf Aft of J!z? is provable in A£ if and only if Aft is a tautology. 
[Hint: Show that L and SA have the same theorems. However, remember 
that none of the results proved about L (such as Propositions 1.8-1.13) 
automatically carries over to ££ . In particular, the deduction theorem is not 
available until it is proved for Aft.] 

1.59 Show that axiom schema (A3) of L can be replaced by the schema 

(->Aft -Aft) {fft Aft) without altering the class of theorems. 

1.60 If axiom schema (10) of L 4 is replaced by the schema (lO)': 

—*ftft ^ [Aft ( ft), then the new system Li is called the mtuitionistic propo- 

sitional calculus. t Prove the following results about Li. 

(a) Consider an (/; + 1 ) -valued logic with these connectives: ->Aft is 0 when Aft 
is n, and otherwise it is n; Aft A & has the maximum of the values of Aft 
and r ft, whereas Aft V *ft has the minimum of these values; and Aft => <ft is 0 
if Aft has a value not less than that of 7?, and otherwise it has the same 
value as ( ft. If we take 0 as the only designated value, all theorems of L] 
are exceptional. 

(b) A 1 V A \ and =^> A\ are not theorems of Lj. 

(c) For any m, the wf 

(A] O A 2 ) V . . . V (A] ^ Am) v (A 2 *>A 3 )V... 

V (A 2 A m ) V ... V (A m -i & Am) 

is not a theorem of Lj 

(d) (Godel, 1933) Li is not suitable for any finite many- valued logic. 

(e) (i) If r, Aft b l,^, then T h l t ^ ^ (deduction theorem) 

(ii) m => <ft => ® u Aft => ® 

(iii) \- Ll Aft^^Aft 

(iv) h Li (AS =* <ft) => (-fft ^ --.#) 

(v) h L , Aft =* {->Aft =* <ft) 

(vi) h —\—i[—i—iAft =£>■ Aft) 

(vii) 77), —'—>Aft h Lj > — 

^The principal origin of intuitionistic logic was L.E.J. Brouwer’s belief that 
classical logic is wrong. According to Brouwer, AS V 77 is proved only when a proof of 
^ or a proof of 7? has been found. As a consequence, various tautologies, such as 
Aft V —Aft, are not generally acceptable. For further information, consult Brouwer 
(1976), Heyting (1956), Kleene (1952), Troelstra (1969), and Dummett (1977). Jas- 
ltowsld (1936) showed that Li is suitable for a many-valued logic with denumerably 
many values. 




(viii) h L t ~ ' 1 ^ 

o r . K if and only if ® is a tautology. 

if and only if is a tautology. 

t> if*i has A and -> as its only connectives, then h l,^ if and only if $ 
is a tautology. 

I a Lei & and be in the relation R if and only if h < S. Show that R 

*’ equivalence relation. Given equivalence classes [38] and [ r A], let 
* * aW — V 9?\, [&]n[ < #] = [& A#], and [#] = [-«#|. Show that the 
v ilence classes under A form a Boolean algebra with respect to n, u 
id Tailed the Lindenbaum algebra L* determined by L. The element 0 of 
- is the equivalence class consisting of all contradictions (i.e., negations of 
tautologies). jLe unit element 1 of L* is the equivalence class consisting of 
aH tautologies. Notice that h \, ( JS r A if and only if [&\ < [<#] in L*, and that 
* ^ <■£ if a nd only if [$} = [#]. Show that a Boolean function / (built up 
lYoin variables , 0, and 1, using u, n and - ) is equal to the constant function 
| in all Boolean algebras if and only if h l/ # , where/# is obtained from / 
bv changing u, n, ", 0 and 1 to V, A, A\ A ->A\, and A \ V-vlj, re- 






2.1 QUANTIFIERS 

There are various kinds of logical inference that cannot be justified on the 
basis of the propositional calculus; for example: 

1 . Any friend of Martin is a friend of John. 

Peter is not John’s friend. 

Hence, Peter is not Martin’s friend. 

2. All human beings are rational. 

Some animals are human beings. 

Hence, some animals are rational. 

3. The successor of an even integer is odd. 

2 is an even integer. 

Hence, the successor of 2 is odd. 

The correctness of these inferences rests not only upon the meanings of the 

j. 

truth-functional connectives, but also upon the meaning of such expressions 
as ‘any’, ‘all’ and ‘some’, and other linguistic constructions. 

In order to make the structure of complex sentences more transparent, it 
is convenient to introduce special notation to represent frequently occurring 
expressions. If P(x) asserts that x has the property P, then (Vx)P(x) means 
that property P holds for all x or, in other words, that everything has the 
property P. On the other hand, (3x)P(x) means that some x has the property 
P - that is, that there is at least one object having the property P. In 
(Vx)P(x), ‘ (Vx) ’ is called a universal quantifier, in (3 x)P(x), ‘(3x)’ is called an 
existential quantifier. The study of quantifiers and related concepts is the 
principal subject of this chapter. 

Examples 

Y. Inference 1 above can be represented symbolically: 

(Vx)(F(x,/w) =>F(x,j)) 


E(p,m ) 
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Here, F(x,} > ) means that x is a friend of y, while m , j and p denote 
Martin, John, and Peter, respectively. 

2'. Inference 2 becomes: 

(Vx) (H (x) => /?(x)) 

(3x)(^l(x) A H(x)) 

(3x)(/l(x) Ai?(x)) 

Here, H, R and A designate the properties of being human, rational, and 
an animal, respectively. 
y Inference 3 can be symbolized as follows: 

(Vx) (/ (x) A E(x) => D(s(x))) 

1(b) A E(b) 

D(s(b)) 

Here, 1, E and D designate respectively the properties of being an integer, 
even and odd; s(x ) denotes the successor of x; and b denotes the integer 2. 

Notice that the validity of these inferences does not depend upon the 
particular meanings of F,m,j,p,H , R,A,I,E,D,s and b. 

Just as statement forms were used to indicate logical structure dependent 
upon the logical connectives, so also the form of inferences involving 
quantifiers, such as inferences 1 3, can be represented abstractly, as in 
1' — 3'. For this purpose, we shall use commas, parentheses, the symbols -> 
and of the propositional calculus, the universal quantifier symbol V, and 
the following groups of symbols: 

Individual variables: 1 5 5 * • * i %f l ) • • • 

Individual constants: a \ , 02 , . . • , a „, . . . 

Predicate letters: A'j, (n and k are any positive integers) 

Function letters: fj! (n and k are any positive integers) 

The positive integer n that is a superscript of a predicate letter A n k or of a 
function letter fj! indicates the number of arguments, whereas the subscript 
k is just an indexing number to distinguish different predicate or function 
letters with the same number of arguments .t 

In the preceding examples, x plays the role of an individual variable; 
m , /, p and b play the role of individual constants; F is a binary predicate 
letter (i.e., a predicate letter with two arguments); H,R t A,I,E and D are 
monadic predicate letters (i.e., predicate letters with one argument); and s is 
a function letter with one argument. 

The function letters applied to the variables and individual constants 
generate the terms : 

1 F 01 example, in arithmetic both addition and multiplication take two argu- 
ments. So, we would use one function letter, say fj, for addition, and a different 
function letter, say j f for multiplication. 
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1 . Variables and individual constants are terms. 

2. If f k is a function letter and are terms, then , U, • • • , t„) is 

a term. 

3. An expression is a term only if it can be shown to be a term on the basis 
of conditions 1 and 2. 


Terms correspond to what in ordinary languages are nouns and noun 
phrases - for example, Two’, Two plus three’, and Two plus x\ 

The predicate letters applied to terms yield the atomic formulas; that is, if 
A’l is a predicate letter and t\ , t 2 , . . . , t„ are terms, then A” k {t\ ,t 2 ,..., t n ) is an 
atomic formula. 

The well-formecl formulas (wfs) of quantification theory are defined as 
follows: 

1. Every atomic formula is a wf. 

2. If dS and 'A are wfs and y is a variable, then (-■$), Y), and ((Vy)$?) 

are wfs. 

3. An expression is a wf only if it can be shown to be a wf on the basis of 
conditions 1 and 2. 


In ((Vy)^), is called the scope of the quantifier ‘(Vy)’. Notice that dfi need 
not contain the variable y. In that case, we understand ((Vy)&) to mean the 
same thing as 

The expressions A ^) , (^ V ^) , and are defined as in system 

L (see page 36). It was unnecessary for us to use the symbol 3 as a primitive 
symbol because we can define existential quantification as follows: 

((Tv).'5?) stands for (-i((Vjc)(-i^))) 

This definition is faithful to the meaning of the quantifiers: &(x) is true for 
some x if and only if it is not the case that 3?(x) is false for all xd 


Parentheses 

The same conventions as made in Chapter 1 (page 20) about the omission of 
parentheses are made here, with the additional convention that quantifiers 
(Vy) and (3y) rank in strength between A, V and =y, a>. 

Examples 

Parentheses are restored in the following steps. 

1. (Vx,)^!(xi) =>A^(x 2 ,Xi) 

((Vxi)T|(xi)) ^A]{x 2 ,X\) 

(((Vxi)Tj(xi)) =^i(*2,Xl)) 

HVe could have taken 3 as primitive and then defined ((Vx)&) as an ab- 
breviation for (-i((3x)(-i.'^))), since e&(x) is true for all x if and only if it is not the 
case that 08 (x) is false for some x. 
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2. (Vxi)^|(xi) VTf(x 2 ,xi) 

(Vxi)(^|(xi) V^J(x 2 ,xi)) 

((Vxi)(v4j(xi)) V ^i(x 2 ,xi))) 

3. (Vxi)(3x 2 )Tj(xi,x 2 ) 

(Vxj )((3x 2 )ylj ! (xi , X 2 )) 

((Vx,)((3x 2 )^(x 1 ,x 2 ))) 

Exercises 

2.1 Restore parentheses to the following. 

(a) (Vxi)^J(x,) A-^j(x 2 ) 

(b) (Vx 2 )v4](x 2 ) ^J(x 2 ) 

(c) (Vx 2 )(3xi)ylf(xi,x 2 ) 

(d) (Vxj ) (Vx 3 ) (Vx4 M j (xi ) A\ (x 2 ) A -> A } (xi ) 

(e) (3 xi)(Vx 2 )(3x 3 )^] (xi) V (3x 2 )-i(Vx 3 )^(x3,x 2 ) 

(f) (Vx 2 )-iylj (xi) ^ ^j(xi,xi,x 2 ) V (Vx])/lJ(xi) 

(g) -i(Vx])^4}(x 1 ) (3x 2 )^|(x 2 ) ^ ^(xj,x 2 ) A^j(x 2 ) 

2.2 Eliminate parentheses from the following wfs as far as is possible. 

(a) (((Vxi)(ylj(xi) =^!(xi))) V ((3xi)ylj(xi))) 

(b) ((--((3x 2 )(^|(x 2 ) V^j(fli)))) <^j(x 2 )) 

(c) (((Vxi)(->(~>yl](a3)))) => (^j(xi) =^](x 2 ))) 

An occurrence of a variable x is said to be bound in a wf & if either it is 
the occurrence of x in a quantifier ‘(Vx)’ in fS or it lies within the scope of a 
quantifier ‘(Vx)’ in Otherwise, the occurrence is said to be free in £8. 

Examples 

1. yli(xi,x 2 ) 

2. ^l|(xi,x 2 ) (Vxi)ylj(xi) 

3. (Vxi)(^i(xi,x 2 ) =>- (Vxi)^|(xi)) 

4. (zbq)ylf(xi,x 2 ) 

In Example 1, the single occurrence of xj is free. In Example 2, the first 
occurrence of xi is free, but the second and third occurrences are bound. In 
Example 3, all occurrences of xi are bound, and in Example 4 both oc- 
currences of xi are bound. (Remember that (3xi)^(xi,x 2 ) is an abbrevia- 
tion of ->(Vxi)-iylf (xi,x 2 ).) In all four wfs, every occurrence of x 2 is free. 
Notice that, as in Example 2, a variable may have both free and bound 
occurrences in the same wf. Also observe that an occurrence of a variable 
may be bound in some wf but free in a subformula of For example, the 
first occurrence of xi is free in the wf of Example 2 but bound in the larger 
wf of Example 3. 

A variable is said to be free (bound) in a wf J if it has a free (bound) 
occurrence in Thus, a variable may be both free and bound in the same 
wf; for example, xi is free and bound in the wf of Example 2. 



Exercises 




2.3 Pick out the free and bound occurrences of variables in the following 
wfs. 

(a) (Vx 3 )(((Vxj)i4f (xj ,x 2 )) => Af(x 3 , a x )) 

(b) (Vx 2 )/lJ(x 3 ,x 2 ) =¥ (Vx 3 )Af(x 3 ,x 2 ) 

(C) ((Vx 2 ) (3 a i )A\ (Ai , A' 2 ,/, 2 (a, , A 2 ))) V -1 (VAi)y4j(A 2 ,/j I (xi)) 

2.4 Indicate the free and bound occurrences of all variables in the wfs of 
Exercises 2.1 and 2.2. 

2.5 Indicate the free and bound variables in the wfs of Exercises 2. 1-2.3. 

We shall often indicate that some of the variables x* , . . . , x^ are free 
variables in a wf 38 by writing 38 as 38 (x^ , . . . ,x,- ). This does not mean that 
38 contains these variables as free variables, nor does it mean that 38 does 
not contain other free variables. This notation is convenient because we can 
then agree to write as 38{t \ , . . . , tk) the result of substituting in 38 the terms 
t\ , . . . , tk for all free occurrences (if any) of x,- , . . . , x, , respectively. 

1 fc 

If 38 is a wf and t is a term, then t is -said to be free for x,- in 38 if no free 
occurrence of x,- in 38 lies within the scope of any quantifier (Vx 7 ), where xj is 
a variable in t. This concept of t being free for x/ in a wf 38(x t ) will have 
certain technical applications later on. It means that, if t is substituted for all 
free occurrences (if any) of x,- in 38(xi), no occurrence of a variable in t 
becomes a bound occurrence in 38 (t). 

Examples 

1. The term x 2 is free for xi in A J (xj ) , butx 2 is not free for xi in (Vx 2 )^4j(xi). 

2. The term f^(x\,x 3 ) is free for xj in (Vx 2 )T^(x 1 ,x 2 ) T{(xi) but is not free 

for xi in (3x 3 ) (Vx 2 )T ^(xi , x 2 ) A\(xi ). 

The following facts are obvious. 

1 . A term that contains no variables is free for any variable in any wf. 

2. A term t is free for any variable in 38 if none of the variables of t is bound 
in 38. 

3. x, is free for x,- in any wf. 

4. Any term is free for x, in 38 if 38 contains no free occurrences of x,-. 


Exercises 

2.6 Is the term /j 2 (xj,x 2 ) free for xj in the following wfs? 

(a) A\ (xi , x 2 ) =?> (Vx 2 )^4j (x 2 ) 

(b) ((Vx 2 )^f(x 2 ,fli)) V (3 x 2 )^4j ( xi,x 2 ) 

(c) ( Vx , )yl 2 (xi,x 2 ) 

(d) (Vx 2 )y4 2 (xi,x 2 ) 

(e) (Vx 2 )A j(x 2 ) => A j (x i , x 2 ) 





2.7 Justify facts 1-4 above. 

When English sentences are translated into formulas, certain general 
guidelines will be useful: 

1 A sentence of the form ‘All 4s are 5s’ becomes (Vx)(4(x) ==> B(x)). For 
example. Every mathematician loves music is translated as 
(Vx) (M (x) =yL(x)), where M(x) means x is a mathematician and L(x) 
means x loves music. 

2. A sentence of the form ‘Some As are £s’ becomes (3x)(4(x) A5(x)). For 
example. Some New Yorkers are friendly becomes (3x)(A(x) AfW), 
where N(x) means x is a New Yorker and F(x) means x is friendly. 

3. A sentence of the form ‘No As are 5s’ becomes (Vx) (A (x) -i5(x))3 For 

example, No philosopher understands politics becomes (Vx) (P(x) 

-i t/(x)), where P(x) means x is a philosopher and £/(x) means x un- 
derstands politics. 

Let us consider a more complicated example: Some people respect ev- 
eryone. This can be translated as (3x)(F(x) A (Vy)(P(y) =^F(x,y))), where 
P(x) means x is a person and R(x,y) means x respects y. 

Notice that, in informal discussions, to make formulas easier to read we 
may use lower-case letters u,v,x,y,z instead of our official notation x,- for 
individual variables, capital letters A, B, C, . . . instead of our official notation 
A’l for predicate letters, lower-case letters fg,h>... instead of our official 
notation fjl for function letters, and lower-case letters a,b,c, . . . instead of 
our official notation c,- for individual constants. 

Exercises 

2.8 Translate the following sentences into wfs. 

(a) Anyone who is persistent can learn logic. 

(b) No politician is honest. 

(c) Not all birds can fly. 

(d) All birds cannot fly. 

(e) x is transcendental only if it is irrational. 

(f) Seniors date only juniors. 

(g) If anyone can solve the problem, Hilary can. 

(h) Nobody loves a loser. 

(i) Nobody in the statistics class is smarter than everyone in the logic class. 

(j) John hates all people who do not hate themselves. 

(k) Everyone loves somebody and no one loves everybody, or somebody 
loves everybody and someone loves nobody. 

(l) You can fool some of the people all of the time, and you can fool all the 
people some of the time, but you can’t fool all the people all the time. 

+As we shall see later, this is equivalent to -i(3x)(4(x) A-fi(x)). 




QUANTIFICATION THEORY 


(m) Any sets that have the same members are equal. 

(n) Anyone who knows Julia loves her. 

(o) There is no set belonging to precisely those sets that do not belong to 
themselves. 

(p) There is no barber who shaves precisely those men who do not shave 
themselves. 

2.9 Translate the following into everyday English. Note that everyday 
English does not use variables. 

(a) (\/x)(M(x) A (Vy)-ilT(x,y) t/(x)), where M(x) means x is a man , 
W(x,y) means x is married to y, and U (x) means x is unhappy. 

(b) (Vx)(F(x) A P(x) A(x, b)), where V(x) means x is an even integer, P(x) 
means x is a prime integer, A (x, y) means x — y\ and b denotes 2. 

(c) ”»(3j)(7(y) A (Vx)(/(x) => L(x,y))), where /(y) means y is an integer and 
L(x,y) means x^y. 

(d) In the following wfs, A\(x) means x is a person and A 2 (x,y) means x 
hates y. 

(i) (3x)(zl j (x) A (Vy) (A } ( y) => A\{x,y))) 

(ii) (Vx) (A } (x) => (Vy) (A j ( y) => A\(x,y))) 

(iii) (3x)(^[(x) A (Vy) (A J ( y) => (A 2 l (x,y)yyA 2 l (v,y)))) 

(e) (\/x)(H(x) ^ (3y)(32)(-t4(y,z) A (Vi/)(P(w,x) <^> (A(u,y) VA ( u,z ))))), 
where H(x) means x is a person, A(u , v) means ‘u = v\ and P(u, x) means 
u is a parent of x. 


2.2 FIRST-ORDER LANGUAGES AND THEIR INTERPRETATIONS. 
SATISFIABILITY AND TRUTH. MODELS 


Well-formed formulas have meaning only when an interpretation is given 
for the symbols. We usually are interested in interpreting wfs whose symbols 
come from a specific language. For that reason, we shall define the notion of 
a first-order language J 


tThe adjective ‘first-order’ is used to distinguish the languages we shall study 
here from those in which there are predicates having other predicates or functions as 
arguments or in which predicate quantifiers or function quantifiers are permitted, or 
both. Most mathematical theories can be formalized within first-order languages, 
although there may be a loss of some of the intuitive content of those theories. 
Second-order languages are discussed in the appendix on second-order logic. Ex- 
amples of higher-order languages are studied also in Godel (1931), Tarski (1933), 
Church (1940), Hasenjaeger and Scholz (1961) and Van Bentham and Doets (1983). 
Differences between first-order and higher-order theories are examined in Corcoran 
(1980). 
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DEFINITION 



A first-order language contains the following symbols. 

(a) The propositional connectives -> and =>, and the universal quantifier 
symbol V. 

(b) Punctuation marks: the left parenthesis (, the right parenthesis), and the 
comma) 

(c) Denumerably many individual variables X ] , X2, — 

(d) A finite or denumerable, possibly empty, set of function letters. 

(e) A finite or denumerable, possibly empty, set of individual constants. 

(f) A non-empty set of predicate letters. 

By a term of ^ we mean a term whose symbols are symbols of ££. 

By a wf of we mean a wf whose symbols are symbols of Sd. 

Thus, in a language ££, some or all of the function letters and individual 
constants may be absent, and some (but not all) of the predicate letters may 
be absent.* The individual constants, function letters and predicate letters of 
a language S£ are called the non-logical constants of f£ . Languages are 
designed in accordance with the subject matter we wish to study. A language 
for arithmetic might contain function letters for addition and multiplication 
and a predicate letter for equality, whereas a language for geometry is likely 
to have predicate letters for equality and the notions of point and line but no 
function letters at all. 


DEFINITION 

Let ££ be a first-order language. An interpretation M of if consists of the 
following ingredients. 

(a) A non-empty set D, called the domain of the interpretation. 

(b) For each predicate letter^'! of an assignment of an o-place relation 
(^'?) M in D. 

(c) For each function letter fj of an assignment of an w-place operation 
( fj) u in D (that is, a function from U 1 into D). 

(d) For each individual constant a-, of an assignment of some fixed 
element (g,-) m of D. 

Given such an interpretation, variables are thought of as ranging over the 
set D, and -i, =*► and quantifiers are given their usual meaning. Remember 
that an n-place relation in D can be thought of as a subset of D", the set of all 

*The punctuation marks are not strictly necessary; they can be avoided by 
redefining the notions of term and wf. However, their use makes it easier to read and 
comprehend formulas. 

*If there were no predicate letters, there would be no wfs. 
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^-tuples of elements of D. For example, if D is the set of human beings, then 
the relation ‘father of’ can be identified with the set of all ordered pairs (x,y) 
such that x is the father of y. 

For a given interpretation of a language Jzf, a wf of if without free 
variables (called a closed 11 / or a sentence) represents a propostion that is 
true or false, whereas a wf with free variables may be satisfied (i.e., true) for 
some values in the domain and not satisfied (i.e., false) for the others. 


Examples 

Consider the following wfs: 

1. A\(x\,x 2 ) 

2. (Vx 2 )^i(xi,x 2 ) 

3. (3xi)(Vx 2 )T?(xi,x 2 ) 

Let us take as domain the set of all positive integers and interpret A j(y,z) as 
y ^ z. Then wf 1 represents the expression ‘xi ^x 2 ’, which is satisfied by all 
the ordered pairs ( a , b ) of positive integers such that a^b. Wf 2 represents 
the expression ‘For all positive integers x 2 , xi ^x 2 ,’l which is satisfied only 
by the integer 1. Wf 3 is a true sentence asserting that there is a smallest 
positive integer. If we were to take as domain the set of all integers, then wf 3 
would be false. 


Exercises 

2.10 For the following wfs and for the given interpretations, indicate for 
what values the wfs are satisfied (if they contain free variables) or whether 
they are true or false (if they are closed wfs). 

(i) Aj(f?(x u x 2 ),a l ) 

(ii) Af(x 1 , x 2 ) ^ A] (x 2 , x, ) 

(iii) (Vx 1 )(Vx 2 )(Vx3)(Tj(xi,x 2 ) A^(x 2 ,x 3 ) Tf(x i,x 3 )) 

(a) The domain is the set of positive integers, Aj(y,z) is y^z, fl(y,z ) is 
y • z, and a\ is 2. 

(b) The domain is the set of integers, A \ (y, z) is y — z,f^( y, z) is y + z, and 
ci is 0. 

(c) The domain is the set of all sets of integers, Aj(y,z) if y C z, 
y fi z, and a\ is the empty set 0. 

2.11 Describe in everyday English the assertions determined by the fol- 
lowing wfs and interpretations. 

(a) (Vx)(Vy)(^(x,y) ^ (3z)(A\ (z) AAf(x,z) AAj(z,y))), where the domain 
D is the set of real numbers, A f (x, y) means x < y\ and A J (z) means z is a 
rational number. 

'i'In ordinary English, one would say ‘xi is less than or equal to all positive 
integers’. 
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(b) (Vx) (A { (x) => (3}>)(A l 2 (y) Ari 2 (y,x))), where D is the set of all days and 
people, A { (x) means x is a day , A\(y) means y is a sucker, and A^(y,x) 
means y is born on day x. 

(c) (Vx) (Vy) (A } (x) A A | (y) =» A 2 {fl(x,y))), where D is the set of integers, 
A\ (x) means a is odd, A 2 (x) means x is even, and fi(x,y) denotes x + y. 

(d) For the following wfs, D is the set of all people and A\(u, v) means u 
loves v. 

(i) (3x)(Vy)(^(x,y) 

(ii) (Vy)(3x)ylf(x,y) 

(iii) (3x)(Vy)((Vz)(^JO,z) ^A\(x,y)) 

(iv) ( 3 a ) (Vy) ~<A \ (x, y) 

The concepts of satisfiability and truth are intuitively clear, but, following 
Tarski (1936), we also can provide a rigorous definition. Such a definition is 
necessary for carrying out precise proofs of many metamathematical results. 

Satisfiability will be the fundamental notion, on the basis of which the 
notion of truth will be defined. Moreover, instead of talking about the 
^-tuples of objects that satisfy a wf that has n free variables, it is much more 
convenient from a technical standpoint to deal uniformly with denumerable 
sequences. What we have in mind is that a denumerable sequence 
a = (ai,A 2,A3, . • -) is to be thought of as satisfying a wf I that has 
x ji , Xj 2 . . . , Xj n as free variables (where j\ < /2 < • • • < j„) if the n-tuple 
(Sj { , sj 2 , . . . , Sj n ) satisfies PA in the usual sense. For example, a denumerable 
sequence (ai,a 2,X3, . . .) of objects in the domain of an interpretation M will 
turn out to satisfy the wf Aj(x 2 , X5) if and only if the ordered pair, (s f 2 , £5} is in 
the relation (A^y assigned to the predicate letter A\ by the interpretation M. 

Let M be an interpretation of a language JA and let D be the domain of 
M. Let X be the set of all denumerable sequences of elements of D. For a wf 
$ of JA, we shall define what it means for a sequence s = (a 1,^2 , . . .) in X to 
satisfy PA in M. As a preliminary step, for a given a in X we shall define a 
function s* that assigns to each term t of JA an element s* ( t ) in D. 

1. If t is a variable x,-, let a*(/) be s f . 

J J -i # 

2. If t is an individual constant aj, then .?*(/) is the interpretation ( a j) of 
this constant. 

3. if ft is a function letter, (f k !) M is the corresponding operation in D, and 

are terms, then 

• • ■ An)) = {r k T{s\h ), . . . ,s*(t n )) 

Intuitively, s* (/) is the element of D obtained by substituting, for each j, a 
name of sj for all occurrences of xj in t and then performing the operations 
of the interpretation corresponding to the function letters of t. For instance, 
if t is f 2 (X3 , f\ (xj , 1 ) ) and if the interpretation has the set of integers as its 
domain, f 2 and /j 2 are interpreted as ordinary multiplication and addition, 
respectively, and a x is interpreted as 2, then, for any sequence r — (a 1? j 2 , • - ■) 
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of integers, ,y*(/) is the integer 53 ■ (si + 2). This is really nothing more than 
the ordinary way of reading mathematical expressions. 

Now we proceed to the definition of satisfaction, which will be an in- 
ductive definition. 

1. If 3ft is an atomic wf A'}.(t \ ,...,/„) and (/(") M is the corresponding 
n-place relation of the interpretation, then a sequence s — (s\,si , . - .) 
satisfies 2ft if and only if (^4") M (j*(q), . . . ,s*(t „ )) - that is. if the «-tuple 
(s*(fi), is in the relation 

2. s satisfies -> tft if and only if s does not satisfy 3ft. 

3 . j satisfies 2ft ^ if and only if s does not satisfy 3ft or s satisfies ( €. 

4. ^ satisfies {fxi)2ft if and only if every sequence that differs from s in at 
most the rth component satisfies 2ftA 

Intuitively, a sequence s = ,S 2 , . ■ ■) satisfies a wf 2ft if and only if, when, 

for each i, we replace all free occurrences of x, (if any) in 2ft by a symbol 
representing s h the resulting proposition is true under the given interpreta- 
tion. 

Now we can define the notions of truth and falsity of wfs for a given 
interpretation. 


DEFINITIONS 


1. A wf 2ft is true for the interpretation M (written |=m 2ft) if and only if every 
sequence in E satisfies 2ft. 

2. 3ft is said to be false for M if and only if no sequence in E satisfies 2ft. 

3. An interpretation M is said to be a model for a set T of wfs if and only if 
every wf in T is true for M. 

The plausibility of our definition of truth will be strengthened by the fact 
that we can derive all of the following expected properties 1-XI of the 
notions of truth, falsity and satisfaction. Proofs that are not explicitly given 
are left to the reader (or may be found in the answer to Exercise 2.12). Most 


^For example, if the domain of the interpretation is the set of real numbers, the 
interpretation of A\ is the relation ^ , and the interpretation of f{ is the function e x , 
then a sequence s — (5 1,^2, ■ ■ •) of real numbers satisfies A\{f\ (*2), *5) if and only if 
eft 2 ^35. If the domain is the set of integers, the interpretation of Ai(x,y,u,v) is 
x-v — u-y, and the interpretation of a[ is 3 , then a sequence ,v = (.S'i,i'2, • ■ ■) of 
integers satisfies ^(*3, «i ,*1 ,*3) if and only if (53) 2 — 3 i| . 

*In other words, a sequence 5 = (si,S2,...,Si ,.. .) satisfies (\/xi)2ft if and only if, 
for every element c of the domain, the sequence (.s ’ i , .s'2 , . . . , c, . . .) satisfies 3 ft. Here, 
($i , S21 - - - , c, . . .) denotes the sequence obtained from (5! , 52, . . . , s,-, . - .) by replacing 
the /th component Sj by c. Note also that, if s satisfies (Vxj) 3 ft, then, as a special case, s 
satisfies 2 ft. 
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of the results are also obvious if one wishes to use only the ordinary intuitive 
understanding of the notions of truth, falsity and satisfaction. 


(I) 

(ID 

(HD 

(IV) 

(V) f 


(VI) 


(VII) 


(a) 8ft is false for an interpretation M if and only if ->88 is true for M. 

(b) 8ft is true for M if and only if ->8ft is false for M. 

It is not the case that both |=m 8ft and |=m -* 8ft\ that is, no wf can be 
both true and false for M. 

If |= M 8ft and |=m 8ft => then |=m 8ft. 

8ft is false for M if and only if |=m 8ft and |=m -f&. 

Consider an interpretation M with domain D. 

(a) A sequence 5 satisfies 8ft A ^ if and only if ^ satisfies 8ft and j 
satisfies ( €. 

(b) s satisfies 8ft V ^ if and only if s satisfies 8ft or s satifies < A. 

(c) 5 satisfies 8ft if and only if 5 satisfies both 8ft and ^ or s 
satisfies neither 8ft nor ( €. 

(d) ■s satisfies (3xj)8ft if and only if there is a sequence s' that differs 
from s in at most the ith component such that s' satisfies 8ft. (In 
other words s = (s\,S 2 , • • . ,-sv, . • .) satisfies (3xj)8ft if and only if 
there is an element c in the domain D such that the sequence 
(^i , 52) • • • , G • • •) satisfies 8ft.) 

|=m 8ft if and only if |=m (8x8) 8ft. We can extend this result in the 
following way. By the closure * of 8ft we mean the closed wf obtained 
from 8ft by prefixing in universal quantifiers those variables, in order 
of descending subscripts, that are free in 8ft. If 8ft has no free vari- 
ables, the closure of 8ft is defined to be 8ft itself. For example, if 8ft is 
A\{x 2 ,xs) * ( 3x2) A “i (x'i , a' 2 , A 3 ) , its closure is (Vx 5 )(Vx 3 )(Vx 2 ) 

(Vxi )8ft. It follows from (VI) that a wf 8ft is true if and only if its 
closure is true. 

Every instance of a tautology is true for any interpretation. (An 
instance of a statement form is a wf obtained from the statement 


form by substituting wfs for all statement letters, with all occurrences 
of the same statement letter being replaced by the same wf. Thus, an 
instance of A\ - 1 A 2 \J A\ is A\(x 2 ) =$■ (~ , (Vxi)^{ (xj)) \I A\(x 2 ).) To 
prove (VII), show that all instances of the axioms of the system L are 
true and then use (III) and Proposition 1.14. 

(VIII) If the free variables (if any) of a wf 8ft occur in the list a,, , . . . , x ik and 
if the sequences s and V have the same components in the 
qth, ...,4-th places, then 5 satisfies 8ft if and only if s J satisfies 8ft 
[Hint: Use induction on the number of connectives and quantifiers in 
8ft. First prove this lemma: If the variables in a term t occur in the list 
a,-, , . . . , x ik , and if a and s' have the same components in the 


* Remember that 8ft A 8ft V <8,83 ^ and (3a i)8ft are abbreviations for 

~'8ft => (8ft => <if) A 8ft) and -i(Vx,-)-i^, respectively. 

+A better term for closure would be universal closure. 
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/j th, . . . ,4th places, then s*(t) = (.?')*(/). In particular, if t contains 
no variables at all, s*(t) = (s'Y(t) for any sequences .? and s'.] 

Although, by (VIII), a particular wf 38 with A free variables is essentially 
satisfied or not only by A-tuples, rather than by denumerable sequences, it is 
more convenient for a general treatment of satisfaction to deal with infinite 
rather than finite sequences. If we were to define satisfaction using finite 
sequences, conditions 3 and 4 of the definition of satisfaction would become 
much more complicated. 

Let , . . . , Xf k be k distinct variables in order of increasing subscripts. Let 
38{x ix , ... , x ik ) be a wf that has x,-, , . . . ,x ik as its only free variables. The set of 
A- tuples (b \ , ■ ■ ■ , bk) of elements of the domain D such that any sequence 
with hi , ■ • • , bk in its i\ th, . . . , 4th places, respectively, satisfies 38(x ix , . . . , x if ) 
is called the relation (or property t) of the interpretation defined by 38. Ex- 
tending our terminology, we shall say that every A- tuple (hi , . . . , b k ) in this 
relation satisfies 38(xj t , . . . , x, A ) in the interpretation M; this will be written 
|=m 38\b \ This extended notion of satisfaction corresponds to the 
original intuitive notion. 


Examples 

1. If the domain D of M is the set of human beings, A^(x,y) is interpreted 
as x is a brother of y, and A\(x,y) is interpreted as a is a parent of y, 
then the binary relation on D corresponding to the wf ,^(xi ,a' 2 ) : 
(3xf)(A*(x\,xf) AA2(xi,X2)) is the relation of unclehood. | =y[ 38\b,c] 
when and only when h is an uncle of c. 

2. If the domain is the set of positive integers, A \ is interpreted as = , f\ is 
interpreted as multiplication, and a x is interpreted as 1, then the wf 
38 {x\ ): 

-u4|(*i>«i) A ( V.r 2 ) ( ( 3 x 3 )A f ( a , , jf (x 2 , x 3 ) ) a 2 ,a'i) VA 2 i (x 2 ,o ] )) 

determines the property of being a prime number. Thus |=m 38[k] if and 
only if A is a prime number. 

(IX) If 38 is a closed wf of a language , then, for any interpretation M, 
either |=m 3d or |=m ~'3d - that is, either 38 is true for M or 3d is false 
for M. [Hint: Use (VIII).] Of course, 38 may be true for some in- 
terpretations and false for others. (As an example, consider A\ (a\). If 
M is an interpretation whose domain is the set of positive integers, A J 
is interpreted as the property of being a prime, and the interpretation 
of a x is 2, then A\(a i) is true. If we change the interpretation by 
interpreting a { as 4, then A\(a\) becomes false.) 

If 38 is not closed - that is, if 3d contains free variables 38 may be neither 
true nor false for some interpretation. For example, if 3d is A^(xi, x 2 ) and we 
consider an interpretation in which the domain is the set of integers and 


t A property is defined when k = 1 . 
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df(v,z) is interpreted as y < z, then 38 is satisfied by only those sequences 

of integers in which xi < $ 2 . Hence, 38 is neither true nor false 
for this interpretation. On the other hand, there are wfs that are not closed 
but that nevertheless are true or false for every interpretation. A simple 
example is the wf Xj(xi) V -vf{(xj), which is true for every interpretation. 




Assume t is free for Xj in 38 (xf). Then (Vx,-)£$(x,-) => 38 (t) is true for all 
interpretations. 


The proof of (X) is based upon the following lemmas. 


lemma 1 

If t and u are terms, s is a sequence in X, f results from t by replacing all 
occurrences of x,- by u, and s' results from s by replacing the /'th component 
of s by s*(u), then s*(t') = (V)*(7). [Hint: Use induction on the length of t3\ 


LEMMA 2 

Let t be free for x,- in 38 (xj). Then*. 

(a) A sequence s — (xi,^2, • • •) satisfies 38(f) if and only if the sequence s J , 
obtained from s by substituting s*(t) for s t in the /'th place, satisfies 
38 (xj). [Hint: Use induction on the number of occurrences of con- 
nectives and quantifiers in 38 (x,-), applying Lemma 1.] 

(b) If (\/xj)38(xj) is satisfied by the sequence 5 , then 38(f) also is satisfied by s. 

(XI) If 38 does not contain x,- free, then (Vx,-)(^ <$) =>- (38 =£■ (Vx,)^ 7 ) is 

true for all interpretations. 


Proof 

Assume (XI) is not correct. Then (Vx,-)(^ ^) =>• (38 =y (Vx,)^ 7 ) is not true 

for some interpretation. By condition 3 of the definition of satisfication, 
there is a sequence s such that s satisfies (Vx,-)(^ =y and .v does not satisfy 
38 =y (Vx/)^ 7 . From the latter and condition 3, s satisfies 38 and x does not 
satisfy (Vx,-)^ 7 . Hence, by condition 4, there is a sequence s', differing from s 
in at most the /'th place, such that s' does not satisfy ( 3. Since x, is free in 
neither (Vx,) (38 3? ) nor 38, and since 5 satisfies both of these wfs, it follows 

by (VIII) that s' also satisfies both (Vxj)(38 =y ^) and 38. Since s' satisfies 

^The length of an expression is the number of occurrences of symbols in the 
expression. 
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(Vx ; -)(^ => C A), il follows by condition 4 that s' satisfies & ( €. Since s' 

satisfies $ =>- ^ and condition 3 implies that / satisfies C S, which con- 
tradicts the fact that s' does not satisfy c €. Hence, (XI) is established. 


Exercises 

2.12 Verify (I) (X). 

2.13 Prove that a closed wf ^ is true for M if and only if & is satisfied by 
some sequence s in X (Remember that X is the set of denumerable sequences 
of elements in the domain of M.) 

2.14 Find the properties or relations determined by the following wfs and 
interpretation s . 

(a) [{3u)A\ (/, 2 (x, u),y)\ A [{3v)A\ (/) 2 (x, v),z)\, where the domain D is the set 
of integers, A 2 is =, and f 2 is multiplication. 

(b) Here, D is the set of non-negative integers, A 2 is =, o\ denotes 0, f 2 is 
addition, and / 2 2 is multiplication. 

(i) [(3 z){^A\{z,a x ) AAj(f 2 (x, z),y))] 

(ii) (3 y)A\{x,f%(y,y)) 

(c) (3x3)^ 2 (/j 2 (xi,x3),x 2 ), where D is the set of positive integers, A 2 is =, 
and f 2 is multiplication, 

(d) A J (a'i ) A (Vx 2 )-^(xi,x 2 ), where D is the set of all living people, A J (x) 
means .x is a man and A 2 (x,y) means x is married to y. 

(e) (i) (3x,)(3x 2 )(^f(xi,X3) A^ 2 (x 2 ,x 4 ) A^ 2 (xi,x 2 )) 

(ii) ( 3 x 3 ) (A 2 (xi , x 3 ) A A 2 (x 3 , x 2 ) ) 

where D is the set of all people, A 2 (x, y) means x is a parent of y, and 
yl 2 (x,y) means x and y are siblings. 

(f) (Vx 3 ) ( ( 3x 4 ) (3 ^ (/j 2 (x 4 , A' 3 ) , X ! ) A ( 3x 4 ) (A 2 fjf (x 4 , A' 3 ) , X 2 ) ) =>- ^J(x 3 ,fli)), 
where D is the set of positive integers, A 2 is = , f 2 is multiplication, and 
a\ denotes 1. 

(g) -^f(x 2 ,xj) A (3y){A\(y,x\) A A\{x 2i y)), 

where D is the set of all people, Aj(u, v) means u is a parent of v, and 
A 2 (u, v ) means u is a wife of v. 

2.15 For each of the following sentences and interpretations, write a 
translation into ordinary English and determine its truth or falsity. 

(a) The domain D is the set of non-negative integers, A 2 is — , f 2 is addi- 
tion, /| is multiplication, a x denotes 0 and a 2 denotes 1 . 

(i) (Vx)(3 y)(A\(xJt{y,y)) V a 2 ))) 

(ii) (Vx) (Vy)^ 2 {f 2 (x, y) , a\ ) =» A\{x,a\) VA 2 fy,a j)) 

(iii) (3yMf(/i 2 (y,y),fl 2 ) 

(b) Here, D is the set of integers, A 2 is =, and f 2 is addition. 

(i) (Vx'i ) (Vx 2 )A 2 {f 2 (x, , x 2 ) , f 2 (x 2 , A'i )) 

(ii) (Vxi ) (Vx 2 )(Vx 3 )v4 2 (/ 2 (x, , f 2 (x 2 , x 3 )) , / 2 {f\ (*1 , * 2 ) , x 3 ) ) 

(iii) (Vxi ) (Vx 2 ) ( 3 x 3 )A 2 (f 2 (x, , x 3 ) , x 2 ) 
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( c ) The wfs are the same as in part (b), but the domain is the set of positive 
integers, A\ is =, and f 2 (x, y) is x y . 

(d) The domain is the set of rational numbers, A 2 is =, A 2 is < , ft is 
multiplication,//^) is x -f 1, and o\ denotes 0. 

(i) {Ax)A 2 {f 2 (x,x),fl{fl(ai))) 

(ii) (Vx)(Vy)(^(x,y) => ( 3z)(Al(x,z ) A A\(z,y))) 

(iii) (Vx)(-v4j(x,fli) =*► {Ay)A 2 {f^(x,y),ft(a x ))) 

(e) The domain is the set of non-negative integers, A 2 (a, v ) means u ^ v, and 
A\ (u, t>, w) means u -f v = w. 

(i) (Vx) (Vy) (Vz) (A] (x, y, z) =* A\ (>' , x, z) ) 

(ii) (Vx) (Vy) (A] (x, x, y) =y A 2 (x, y ) ) 

(iii) ( Vx) ( Vy) (Aj(x, y) A\(x,x,y)) 

(iv) (3 x)(VvMi(x,TO ; ) 

(v) (3y)(Vx)/f 2 (x,y) 

(vi) (Vx) (Vy) (A 2 (x, y) & (3 z)A\(x,z,y)) 

(f) The domain is the set of natural numbers, A 2 (a, v) means u — v, 
ft (a, v) = u + v, and ft (a, v) = u ■ v 

(Vx) (3 \y) (3z)3 2 (x, ft (ft (y , y) , ft (z, z) )) 



DEFINITIONS 

A wf 3d is said to be logically valid if and only if 3d is true for every 
interpretation.! 

3d is said to be satisfiable if and only if there is an interpretation for which 
3d is satisfied by at least one sequence. 

It is obvious that 3d is logically valid if and only if -*3d is not satisfiable, 
and 3d is satisfiable if and only if -i 3d is not logically valid. 

If 3d is a closed wf, then we know that 3d is either true or false for any 
given interpretation; that is, 3d is satisfied by all sequences or by none. 
Therefore, if 3d is closed, then 3d is satisfiable if and only if 3d is true for some 
interpretation. 

A set r of wfs is said to be satisfiable if and only if there is an inter- 
pretation in which there is a sequence that satisfies every wf of F. 

It is impossible for both a wf 3d and its negation ^3d to be logically valid. 
For if 3d is true for an interpretation, then -■ 3d is false for that interpretation. 

We say that 3d is contradictory if and only if 3d is false for every inter- 
pretation, or, equivalently, if and only if ~>3d is logically valid. 

3d is said to logically imply <€ if and only if, in every interpretation, every 
sequence that satisfies 3d also satisfies ( 3. More generally, 'A is said to be a 


/he mathematician and philosopher G.W. Leibniz (1646-1716) gave a similar 
definition: 3d is logically valid if and only if 3d is true in all ‘possible worlds’. 
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logical consequence of a set T of wfs if and only if, in every interpretation, 
every sequence that satisfies every wf in T also satisfies c €. 

3ft and c 3 are said to be logically equivalent if and only if they logically 
imply each other. 

The following assertions are easy consequences of these definitions. 

1 . 3ft logically implies if and only if 3ft => <3 is logically valid. 

2. 3ft and are logically equivalent if and only if 3ft <=> <€ is logically valid. 

3. Tf 3ft logically implies and 3ft is true in a given interpretation, then so is ( 3. 

4. If 33 is a logical consequence of a set F of wfs and all wfs in F are true in a 
given interpretation, then so is 33. 


Exercise 2.16 
Prove assertions 1-4. 

Examples 

1 . Every instance of a tautology is logically valid (VII). 

2. If t is free for x in 3ft(x), then (yx)3ft(x) 3ft(t) is logically valid (X). 

3. If 3ft does not contain x free, then (Vx) ( 3ft 33) {3ft =>- (Vx) ( 6) is logi- 

cally valid (XI). 

4. 3ft is logically valid if and only if (Vyj) . . . (3y„)3ft is logically valid (VI). 

5. The wf (Vx'2)(3xi)4j(xi,X 2) =*► ( 3x i ) (Vx 2 )A j (xj , X 2 ) is not logically valid. 
As a counterexample, let the domain D be the set of integers and let 
Aj(y,z) mean y < z. Then (Vx'2)(3x 1 )4j(xi,X2) is true but (3xi)(Vx2) 
4j(xj ,x 2 ) is false. 

Exercises 

2.17 Show that the following wfs are not logically valid. 

(a) [(Vx^Hxi) (Vx!M»(x,)] => [(Vxi)(4[(xi) => 4^(xj))] 

(b) [(Vxi)(4j(xi) V4}(xi))] ^ [((Vxj)4j (xi )) V (Vx^T^xi)] 

2.18 Show that the following wfs are logically valid.! 

(a) 3ft (t) (3 Xi)3ft(xi) if t is free for x,- in 3ft(xi) 

(b) {\/xj)3ft => (3x,-)^ 

(c) (Vx,) (\/xj)3ft => (\/ Xj )(Vxj)3ft 

(d) (3xj)3ft & -.(3 Xi)^3ft 

(e) {\fx-){3ft =>- 33) ((\/xj)3ft => (Vx,-)^) 

(f) ((3 Xj) 3ft) A (Vx,)^ <£> (Vx,) (3ft A 33) 

(g) ((3xj)3ft) V (Vx,)*? => (Vx,-) (3ft v <i) 

(h) (3xi)(3xj)3ft & (3xj)(3x,)3ft 

'At this point, one can use intuitive arguments or one can use the rigorous 
definitions of satisfaction and truth, as in the argument above for (XI). Later on, we 
shall discover another method for showing logical validity. 
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(j) (3xj)(\/xj)& =s> (\/Xj)(3xj)& 

2.19 (a) If Sd is a closed wf, show that PA logically implies 'if if and only if 'if 

is true for every inteipretation for which Pd is true. 

(b) Although, by (VI), (Vx, )A j (x, ) is true whenever (x,) is true, 
find an interpretation for which 3 j (x, ) (Vx, )A } (x, ) is not 
true. (Hence, the hypothesis that Pd is a closed wf is essential 
in (a).) 

2.20 Prove that, if the free variables of Pd are y \ , . . . ,y„, then Pd is satisfiable 
if and only if (3p,) . . . (3y„)<3 is satisfiable. 

2.21 Produce counterexamples to show that the following vvfs are not 
logically valid (that is, in each case, find an inteipretation for which the wf is 
not true). 

(a) [(Vx) (V_y) (Vz) (A 2 (x , y) AA](y,z) => A](x,z)) A (Vx)-.^(x,x)] 

=> (3x)(Vp)^5(x,p) 

(b) (Vx)(3p)v4j(x,p) =>- (3 \y)A 2 (y,y) 

(c) (3x) (3y)A 2 (x, y) =P (3y)A 2 (y,y) 

(d) [(3x)3|(x) ^ (3x)Tj(x)] (Vx) (A ] (x) & A\(x)) 

(e) (3x)(3{ (x) 4(x)) ((3x)3|(x) => (3x)3^(x)) 

(f) [(Vx)(Vy)(^5(x,p) A](y,x)) A (Vx)(Vy)(Vz)(3j(x,p) A A 2 (y,z) 

=>3^(x,z))] =5> (Vx)3j(x,x) 

(g) ° (3x)(Vy)(^(x,y) A ~'A 2 l (y,x) => \A\{x,x) & A 2 (y>,y)]) 

(h) (Vx) (V» (Vz) (A] (x, x) A (A 2 (x, z) => A] (x, y) V A] (y, z))) 

=* (3p)(Vz)^(y,z) 

(i) (3x)(Vy)(3z)((3?(y,z) ^ 3?(x,z)) (3?(x,x) 3?(y,x))) 

2.22 By introducing appropriate notation, write the sentences of each of 
the following arguments as wfs and determine whether the argument is 
correct, that is, determine whether the conclusion is logically implied by the 
conjunction of the premisses 

(a) All scientists are neurotic. No vegetarians are neurotic. Therefore, no 
vegetarians are scientists. 

(b) All men are animals. Some animals are carnivorous. Therefore, some 
men are carnivorous. 

(c) Some geniuses are celibate. Some students are not celibate. Therefore, 
some students are not geniuses. 

(d) Any barber in Jonesville shaves exactly those men in Jonesville who do 
not shave themselves. Hence, there is no barber in Jonesville. 

(e) For any numbers x,y,z, if x > y and y > z, then x > z. x > x is false for 
all numbers x. Therefore, for any numbers x and y, if x > y, then it is 
not the case that y > x. 
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(f) No student in the statistics class is smarter than every student in the 
logic class. Hence, some student in the logic class is smarter than every 
student in the statistics class. 

(g) Everyone who is sane can understand mathematics. None of Hegel’s 
sons can understand mathematics. No madmen are fit to vote. Hence, 
none of Hegel’s sons is fit to vote. 

(h) For every set x, there is a set y such that the cardinality of y is greater 
than the cardinality of x. If x is included in y, the cardinality of x is not 
greater than the cardinality of y. Every set is included in V. Hence, V is 
not a set. 

(i) For all positive integers x,x^x. For all positive integers x,y, z, if x^y 
and y^z, then x<z. For all positive integers x and y, x^y or y^x. 
Therefore, there is a positive integer y such that, for all positive integers 
x,y <x. 

(j) For any integers x,y, z, if x > y and y > z, then x > z. x > x is false for 
all integers x. Therefore, for any integers x and y, if x > y, then it is not 
the case that y > x. 

2.23 Determine whether the following sets of wfs are compatible - that is, 

whether their conjunction is satisfiable. 

(a) (3x)(3y)Tj(x,y) 

(Vx)(Vy)(3z)(Tj(x, z) AA^(z,y)) 

(b) (Vx)(3y)Tj(y,x) 

(Vx) (Vy) (Aj(x, y) => ^A\{y,x)) 

( Vx) (Vy) (Vz ) (y4 j (x, y ) A^f(y,z) ^Tj(x,z)) 

(c) All unicorns are animals. 

No unicorns are animals. 

2.24 Determine whether the following wfs are logically valid. 

(a) ->(3y) (Vx) (A\ (x, y) ^ -Aj(x, x)) 

(b) [(3x)/l J (x) => (3 x)A\ (x)] ^ (3x) (A \ (x) =i> A\ (x)) 

(c) ( 3x) (A | (x) (Vy)A | (y ) ) 

(d) (Vx) (Al(x)VA^ (x) ) ^ ( ( (Vx)T J (x) ) V ( 3 x)A l 2 (x) ) 

(e) (3x)(3y)(Tj(x,y) => (Vz)3?(z,y)) 

(f) (3x)(3y)(T|(x) =>- A ] 2 (y)) => (3x)(T j (x) => A\{x)) 

(g) (Vx) (A J (x) =>A\(x)) => - | (Vx) (A } (x) => —iA 2 (x) ) 

(h) (3x)Tf(x,x) => (3x)(3 \y)A\(x,y) 

2.25 Exhibit a logically valid wf that is not an instance of a tautology. 

However, show that any logically valid open wf (that is, a wf without 

quantifiers) must be an instance of a tautology. 

2.26 (a) Find a satisfiable closed wf that is not true in any interpretation 

whose domain has only one member. 

(b) Find a satisfiable closed wf that is not true in any interpretation 
whose domain has fewer than three members. 
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2.3 FIRST-ORDER THEORIES 

fn the case of the propositional calculus, the method of truth tables provides 
an effective test as to whether any given statement form is a tautology. 
However, there does not seem to be any effective process for determining 
whether a given wf is logically valid, since, in general, one has to check the 
truth of a wf for interpretations with arbitrarily large finite or infinite do- 
mains. In fact, we shall see later that, according to a plausible definition of 
‘effective’, it may actually be proved that there is no effective way to test for 
logical validity. The axiomatic method, which was a luxury in the study of 
the propositional calculus, thus appears to be a necessity in the study of wfs 
involving quantifiers, t and we therefore turn now to the consideration of 
hrst-order theories. 

Let Jz? be a first-order language. A first-order theory in the language Jz? will 
be a formal theory K whose symbols and wfs are the symbols and wfs of ££ 
and whose axioms and rules of inference are specified in the following way.* 

The axioms of K are divided into two classes: the logical axioms and the 
proper (or non-logical) axioms. 


LOGICAL AXIOMS 

If 38, ^ and 9 are wfs of Jz? , then the following are logical axioms of K: 

(Al) 38 =» (tf =* 38) 

(A2) {38 =y {f£ =* 3)) =» {{38 => <#) ^ [38 =* 9)) 

(A3) (-V8? => ->38) =* {{-f€ =* 38) =» <g) 

(A4) (Vxj)38(xj) =>• 38{t ) if 38{x ( ) is a wf of and t is a term of S3 that is 

free for x,- in 38{xj). Note here that t may be identical with x, so that 
all wfs (f/xj)38 =>- 38 are axioms by virtue of axiom (A4). 

(A5) (Vxj)(38 => <&) =>■ {38 => (Vx,-)#) if 38 contains no free occurrences of 

Xj. 

'There is still another reason for a formal axiomatic approach. Concepts and 
propositions that involve the notion of interpretation and related ideas such as truth 
and model are often called semantical to distinguish them from syntactical concepts, 
which refer to simple relations among symbols and expressions of precise formal 
languages. Since semantical notions are set-theoretic in character, and since set 
theory, because of the paradoxes, is considered a rather shaky foundation for the 
study of mathematical logic, many logicians consider a syntactical approach, con- 
sisting of a study of formal axiomatic theories using only rather weak number- 
theoretic methods, to be much safer. For further discussions, see the pioneering study 
on semantics by Tarski (1936), as well as Kleene (1952), Church (1956) and Hilbert 
and Bernays (1934). 

*The reader might wish to review the definition of formal theory in Section 1.4. 
We shall use the terminology (proof, theorem, consequence, axiomatic, h 38 etc.) and 
notation (T F 38, F 38) introduced there. 
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These cannot be specified, since they vary from theory to theory. A first- 
order theory in which there are no proper axioms is called a first-order 
predicate calculus. 


RULES OF INFERENCE 

The rules of inference of any first-order theory are: 

1. Modus ponens: ^ follows from 88 and 88 =y < €. 

2. Generalization: (\/xf)88 follows from 88. 

We shall use the abbreviations MP and Gen, respectively, to indicate ap- 
plications of these rules. 


DEFINITION 

Let K be a first-order theory in the language 8£. By a model of K we mean 
an interpretation of for which all the axioms of K are true. 

By (III) and (VI) on page 61, if the rules of modus ponens and gen- 
eralization are applied to wfs that are true for a given interpretation, then 
the results of these applications are also true. Hence every theorem of K is 
true in every model of K. 

As we shall see, the logical axioms are so designed that the logical con- 
sequences (in the sense defined on pages 65-6) of the closures of the axioms 
of K are precisely the theorems of K. In particular, if tC is a first-order 
predicate calculus, it turns out that the theorems of K are just those wfs of K 
that are logically valid. 

Some explanation is needed for the restrictions in axiom schemas (A4) 
and (A5). In the case of (A4), if t were not free for x, in 88 (xf, the following 
unpleasant result would arise: let 88{x\) be ~'(fix 2 )A 2 x {x\ ,xf) and let t be x 2 . 
Notice that t is not free for X] in 88{x\). Consider the following pseudo- 
instance of axiom (A4): 

(V) ( Vx 1 ) ( “•( Vx 2 )^ (x 1 , X‘ 2 ) ) => ->(Vx 2 )^(x 2 ,x 2 ) 

Now take as interpretation any domain with at least two members and let 
A j stand for the identity relation. Then the antecedent of (V) is true and the 
consequent false. Thus, (V) is false for this interpretation. 

In the case of axiom (A5), relaxation of the restriction that x, not be free 
in 88 would lead to the following disaster. Let 88 and both be A { (xi ). Thus, 
xi is free in 88. Consider the following pseudo-instance of axiom (A5): 


(VV) (Vx0(^!(x,) =»4(x0) =» (4(xj) =» (VxQ^Kxi)) 
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'I’he antecedent of (VV) is logically valid. Now take as domain the set of 
integers and let A J (x) mean that x is even. Then (Vxi )A j (xi) is false. So, any 
sequence 5 = (.V1.X2, . . .) for which xi is even does not satisfy the consequent 
of (VV)d Hence, (VV) is not true for this interpretation. 
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Examples of first-order theories 

1 . Partial order. Let the language have a single predicate letter A\ and no 
function letters and individual constants. We shall write x, <xj instead of 
A 2 (x/ , Xy ) . The theory K has two proper axioms. 

(a) (Vxi)(-ixi < xi ) (irreflexivity) 

(b) (Vxi)(Vx 2 )(Vx3)(xi < x'2 A x 2 < X 3 =^xi < X 3 ) (transitivity) 


A model of the theory is called a partially ordered structure. 


2 . Group theory. Let the language have one predicate letter A I 2 , one 
function letter ffi and one individual constant a\. To conform with or- 
dinary notation, we shall write t -- s instead of A^(t, s),t-j- s instead of 
fi 2 (t, s), and 0 instead of a\. The proper axioms of K are: 


(a) ( Vx 1 ) ( Vx 2 ) ( VX3 ) (xi -f (x 2 -f X3) = (xj -f x 2 ) -f x 3 ) 

(b) (Vxi)( 0 -f xi =xi) 

(c) (Vxi ) (3x 2 )(x 2 + xi = 0 ) 

(d) (Vxi)(x! = xj) 

(e) (Vxi ) (Vx 2 ) (xi = x 2 = 3 - x'2 = xi ) 

(f) (Vxi )(Vx 2 )(Vx3)(xi = x 2 A x 2 = x 3 xi = X3) 

(g) (Vxi)(Vx 2 )(Vx3)(x 2 =X3 => 

■^'1 + X 2 = X'l -f X '3 A X 2 -f X! = X3 -f X'l ) 


(associativity) 
(identity) 
(inverse) 
(reflexivity of =) 
(symmetry of = ) 
(transitivity of = ) 

(substitutivity of =) 


A model for this theory, in which the interpretation of = is the identity 
relation, is called a group. A group is said to be abelian if, in addition, the wf 
(Vxi)(Vx 2 )(xi -fx 2 — x 2 + xj) is true. 

The theories of partial order and of groups are both axiomatic. In gen- 
eral, any theoiy with a finite number of proper axioms is axiomatic, since it 
is obvious that one can effectively decide whether any given wf is a logical 
axiom. 


2.4 PROPERTIES OF FIRST-ORDER THEORIES 

All the results in this section refer to an arbitrary first-order theory K. 
Instead of writing t-K we shall sometimes simply write h 3ft. Moreover, 
we shall refer to first-order theories simply as theories, unless something is 
said to the contrary. 

I Such a sequence would satisfy / 4 }(xi), since $i is even, but would not satisfy 

(Vxi)d}(xi), since no sequence satisfies (Vxi)d}(xi). 



PROPOSITION 2.1 




Every wf 38 of K that is an instance of a tautology is a theorem of K, and it 
may be proved using only axioms (A1) (A3) and MP. 

Proof 

38 arises from a tautology 3T by substitution. By Proposition 1. 14, there is a 
proof of 3E in L. In such a proof, make the same substitution of wfs of K for 
statement letters as were used in obtaining $ from and, for all statement 
letters in the proof that do not occur in substitute an arbitrary wf of K. 
Then the resulting sequence of wfs is a proof of 38, and this proof uses only 
axiom schemes (Al) (A3) and MP. 

The application of Proposition 2.1 in a proof will be indicated by writing 
‘Tautology’. 


PROPOSITION 2.2 

Every theorem of a first-order predicate calculus is logically valid. 


Proof 

Axioms (A1)-(A3) are logically valid by property (VII) of the notion of 
truth (see page 61), and axioms (A4) and (A5) are logically valid by prop- 
erties (X) and (XI). By properties (III) and (VI), the rules of inference MP 
and Gen preserve logical validity. Hence, every theorem' of a predicate 
calculus is logically valid. 

Example 

The wf (Vx'2)(3xi)/4i(xi,X2) ==> ( 3x\ ) (Vx 2 )A ^ (xi , * 2 ) is not a theorem of any 
first-order predicate calculus, since it is not logically valid (by Example 5, 

p. 66). 

DEFINITION 

A theory K is consistent if no wf 38 and its negation ->38 are both provable in 
K. A theory is inconsistent if it is not consistent. 


COROLLARY 2.3 


Any first-order predicate calculus is consistent. 
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proof 

If a wf 38 and its negation ~>38 were both theorems of a first-order predicate 
calculus, then, by Proposition 2.2, both 38 and -^38 would be logically valid, 
which is impossible. 

Notice that, in an inconsistent theory K, every wf of K is provable in 
K. In fact, assume that 38 and -i 38 are both provable in K. Since the wf 
$ (p38 =4- *$) is an instance of a tautology, that wf is, by Proposition 2. 1 , 

provable in K. Then two applications of MP would yield h c 3. 

It follows from this remark that, if some wf of a theory K is not a 
theorem of K, then K is consistent. 

The deduction theorem (Proposition 1 .9) for the propositional calculus 
cannot be carried over without modification to first-order theories. For 
example, for any wf 38, 38 Hk (Vx,-)^?, but it is not always the case that 
p K 38 => (3x3)38. Consider a domain containing at least two elements c and 
d. Let K be a predicate calculus and let 38 be A } (xi ) . Interpret A\ as a 
property that holds only for c. Then A\(x\) is satisfied by any sequence 
s = • • .) in which Si = c , but (Vxi)yfi(xi) is satisfied by no sequence at 

all. Hence, yf](xi) (xi) is not true in this interpretation, and so it is 

not logically valid. Therefore, by Proposition 2.2, A \ (xi ) => (Vxi)/fJ (xi) is 
not a theorem of K. 

A modified, but still useful, form of the deduction theorem may be de- 
rived, however. Let 38 be a wf in a set T of wfs and assume that we are given 
a deduction 33 \, . . . , 33 n from T, together with justification for each step in 
the deduction. We shall say that 38 i depends upon 38 in this proof if and 
only if: 


(1) B i is 38 and the justification for 33 \ is that it belongs to T, or 

(2) 33 i is justified as a direct consequence by MP or Gen of some preceding 
wfs of the sequence, where at least one of these preceding wfs depends 
upon 38. 


Example 

38, (Vxi )38^<3 3 (Vx {fg 

(S>i) 38 Hyp 

(@2) (Vxi )38 (^1), Gen 

(B 3 ) (Vxi )38 => Hyp 

(&a) « (®2),(®3),MP 

(&s) (Vxi ( 0 4 ), Gen 

Here, (33\) depends upon 38, {33 2 ) depends upon 38, (B 3 ) depends upon 
(Vx\)38 =>- ( €, {33 4 ) depends upon 38 and (\fx\)38 =4* ( 3, and (38 5 ) depends 
upon 38 and (Vxi )38 =4- c €. 
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PROPOSITION 2.4 

If ^ does not depend upon ^ in a deduction showing that Y,3ftY then 

rv<€. 

Proof 

Let Si • • • , S„ be a deduction of ^ from T and 3ft ^ in which ^ does not 
depend upon 3ft. (In this deduction, S„ is #.) As an inductive hypothesis, let 
us assume that the proposition is true for all deductions of length less than n. 
If # belongs to F or is an axiom, then T h c 6. If ^ is a direct consequence of 
one or two preceding wfs by Gen or MP, then, since ^ does not depend 
upon 3ft, neither do these preceding wfs. By the inductive hypothesis, these 
preceding wfs are deducible from T alone. Consequently, so is ( €. 

PROPOSITION 2.5 (DEDUCTION THEOREM) 

Assume that, in some deduction showing that T, 3ft h no application of 
Gen to a wf that depends upon 3ft has as its quantified variable a free 
variable of 3ft. The T I- 3ft => c €. 

Proof 

Let Si, . . . , S„ be a deduction of from T and 3ft, satisfying the assumption 
of our proposition. (In this deduction, S„ is ( €.) Let us show by induction 
that S,- for each i^n. If S,- is an axiom or belongs to T, then 

r h 3ft => S'/, since S/ => (33 => S,-) is an axiom. If S,- is 3ft, then 
r h 3ft => S,-, since, by Proposition 2.1, h 3ft => 33: If there exist j and k less 
than i such that S* is Sy => S,-, then, by inductive hypothesis, T h J 1 => Sy 
and T h ^ (Sy =» S z ). Now, by axiom (A2), h (^ (Sy =4> S,-)) 
((3ft =4> Sy) => =>■ S/)). Hence, by MP twice, T h 3ft ==> S z . Finally, 

suppose that there is some j < i such that S/ is (VxQSy. By the inductive 
hypothesis, rh,f => Sy, and, by the hypothesis of the theorem, either Sy 
does not depend upon 3ft or Xk is not a free variable of 3ft. If Sy does not 
depend upon 3%, then, by Proposition 2.4, T h Sy and, consequently, by 
Gen, T h (Vx'/.)Sy. Thus, T f- S,-. Now, by axiom (Al), h S/ => {3ft =4> S,-). 
So, S,- by MP. If, on the other hand, x* is not a free variable of 

then, by axiom (A5), h (\fxk)(3ft => Sy) => => (Vx*)Sy). Since Y \r 3ft 

Sy, we have, by Gen, T h (Vx*)(^ =s> Sy), and so, by MP, Y h 3ft => (Vx*)Sy; 
that is, rh^=4- S/. This completes the induction, and our proposition is 
just the special case i = n. 

The hypothesis of Proposition 2.5 is rather cumbersome; the following 
weaker corollaries often prove to be more useful. 





PROPERTIES OF FIRST-ORDER THEORIES 

COROLLARY 2.6 

If a deduction showing that T, $ h ^ involves no application of Gen of 
which the quantified variables is free in then T h $ ==> ( 6. 

COROLLARY 2.7 

If $ is a closed wf and T, $ h then T V $ 

EXTENSION OF PROPOSITIONS 2.4~2.7 

In Propositions 2.4—2. 7, the following additional conclusion can be drawn 
from the proofs. The new proof of T h $ => <€ (in Proposition 2.4, of T h 
involves an application of Gen to a wf depending upon a wf $ of T only if 
there is an application of Gen in the given proof of T, $ h ^ that involves 
the same quantified variable and is applied to a wf that depends upon S. ( In 
the proof of Proposition 2.5, one should observe that Q>j depends upon a 
premiss $ of T in the original proof if and only if $ => S)j depends upon $ 
in the new proof.) 

This supplementary conclusion will be useful when we wish to apply the 
deduction theorem several times in a row to a given deduction - for ex- 
ample, to obtain T h Q) => => #) from T, &>, h from now on, it is to 

be considered an integral part of the statements of Propositions 2.4— 2.7. 

Example 

h (Vxi)(Vx 2 )^ ( Vx 2 ) ( Yx i ) 3$ 

Proof 


1. (Vxi)(Vx 2 )Y? Hyp 

2. (Vx i)(Vx 2 )^ (Vx 2 )^ (A4) 

3. (Vx 2 )^ 1, 2, MP 

4. (Vx 2 )^ ® (A4) 

5. @ 3, 4, MP 

6. (Vx \)$ 5, Gen 

7. (Vx 2 )(Vxi)^ 6, Gen 


Thus, by 1-7, we have (Vxi)(Vx 2 ).# h (Vx' 2 )(Vxi)^, where, in the deduction, 
no application of Gen has as a quantified variable a free variable of 
(Vxi)(Vx 2 )^. Hence, by Corollary 2.6, h (Vxi)(Vx 2 )^ => (Vx' 2 )(Vxi)^. 

Exercises 


2.27 Derive the following theorems. 

(a) h (Vx)(^ <€) ((Vx)r^ => (Vx)7?) 
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(b) h (Vx)(# => <€) =» ((3x)# => (3x)^) 

(c) h (Vx)($ A ^ ((Vx)^) A (Vx)# 

(d) F (tyi) . . . (V*)# =» 0 

(e) h -.(Vx)^ =4> (3x)-«# 

2.28 D Let K be a first-order theory and let K# be an axiomatic theory 
having the following axioms: 

(a) (Vyi) . . . (Vj„)^?, where $ is any axiom of K and y\, . . . ,y„(n^ 0) are 
any variables (none at all when n — 0); 

(b) (Vgi) . . . (V y„)(<% =► tf) ^ [(V 7l ) . . . (V*)# =» (Yv,) . . . (Vy„)^] where a 
and ^ are any wfs and yi ... ,y„ are any variables. 

Moreover, K # has modus ponens as its only rule of inference. Show that K# 
has the same theorems as K. Thus, at the expense of adding more axioms, 
the generalization rule can be dispensed with. 

2.29 Carry out the proof of the Extension of Propositions 2.4-2. 7 above. 


2.5 ADDITIONAL METATHEOREMS AND DERIVED RULES 


For the sake of smoothness in working with particular theories later, we 
shall introduce various techniques for constructing proofs. In this section it 
is assumed that we are dealing with an arbitray theory K. 

Often one wants to obtain ${t) from (Vx)^(x), where ns a term free for x 
in 0(x). This is allowed by the following derived rule. 


PARTICULARIZATION RULE A4 

If t is free for x in $ (x ) , then (Vx)^(x) I- &(t). 1 


Proof 

From (Vx)^(x) and the instance (Vx)^(x) => ${t) of axiom (A4), we obtain 
dd(t) by modus ponens. 

Since x is free for x in 3fl{x), a special case of rule A4 is: (Vx),4? I- 
There is another very useful derived rule, which is essentially the con- 
trapositive of rule A4. 


tFrom a strict point of view, (Vjc)^(jc) F 3fl{t) states a fact about derivability. 
Rule A4 should be taken to mean that, if (Vx)£B(x) occurs as a step in a proof, we 
may write i%(t) as a later step (if t is free for x in @{x)). As in this case, we shall often 
state a derived rule in the form of the corresponding derivability result that justifies 
the rule. 
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EXISTENTIAL RULE E4 

Let t be a term that is free for x in a wf 83{x,t), and let 83{t,t) arise from 
g${x, t) by replacing all free occurrences of x by t. {83{x, t ) may or may not 
contain occurrences of /.) Then, 83{t, t) \~ (3 x)83{x,t). 

Proof 

It suffices to show that I- 83{t, t) =>■ (3x)83{x, t). But, by axiom (A4), 
h (\/x)->8?{x,t) => ->83{t, t). Hence, by the tautology ( A => -i B) => (B => ->A) 
and MP, h 83{t,t) =4> ->(Vx )->83{x,t), which, in abbreviated form, is I- 83{t,t) 

=> {3x)83{x, t ). 

A special case of rule E4 is 83{t) \- {3x)83{x), whenever t is free for x in 
83 {x). In particular, when t is x itself, 83 (x) h (3 x)83{x). 

Example 

h (Vx) 83 => (3 x)83 

1. (Vx)J 1 Hyp 

2. 83 1, rule A4 

3. (3 x)83 2, rule E4 

4. (\/x)83 h (3 x)83 1-3 

5. h (Vx)83 => {3x)83 1-4, Corollary 2.6 

The following derived rules are extremely useful. 

Negation elimination: ->->83 I- 83 
Negation introduction: 83 h ->->83 
Conjunction elimination: 83 A f 8 V- 83 

83 h<8\- <8 

->{83 A c 8) I — >83 V —88 
Conjunction introduction: 83, <8 I- 83 A ( 8 
Disjunction elimination: 83 \J ( 8, ->83 I- <8 

83 V ( 8, -88 V 83 
->{83 V <8) h ->83 a -88) 

8ft => 9, f 8 => 9, 83 V f 8 h 9 
Disjunction introduction: 83 \- 83\j<8 

<8\~83\l<8 

Conditional elimination: 83 => c 8, —88 I — >83 

83 => —88, ( 8 1 — >83 
->83 => <8, -88 h 83 
->83 =» -8$, <8 h # 

—>{83 => E ^ 

Conditional introduction: -i^ I — 1 (^ => 

Conditional contrapositive: ^ ^ I — =4> -■,# 
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-f€ =» -T0& h # => 

Biconditional elimination: $ ^ I- ^ -■# I — 

$ & «?, <6 h ^ r ^, -.tf F -.# 

# <* <2? V # =» ^ h ^ 

Biconditional introduction: 

Biconditional negation: ^ # I — i# 4 => 

-# & -f€ h # ( 4 

Proof by contradiction: If a proof of T, ->$ I- ( € A -f€ involves no applica- 
tion of Gen using a variable free in then Th (Similarly, one obtains 
r I — ■# from r,^h^A 


Exercises 

2.30 Justify the derived rules listed above. 

2.31 Prove the following. 

(a) h (Vx)(V>M?(x,t) => (Vx)4?(x,x) 

(b) F [(Vx)#] V [(Vx)tf] =» (Vx) (# V <£) 

(c) F --(3x)# =» (Vx)--# 

(d) F (Vx)# => (Vx) (# V 

(e) h (Vx)(Vy)(Af(x,y) => ^(>\x)) => (Vx)^(x,x) 

(f) h [(3x)# => (Vx)^] => (Vx) (# => <#) 

(g) F (Vx)(# V <€) =» [(Vx)#] V (3x)tf 

(h) h (Vx)(/4j(x,x) => (3j;)Tf(x,>’)) 

(i) F (Vx) (# =* ( €) => [(Vx) =» (Vx) -i#] 

(j) i- hy) U 1 0>) =* (YvM J 6’)] 

(k) F (Vx)# => (Vx)(# V r A) 

(l) F [(Vx)(Vy)(#(x,j) => #Q>,x)) A (Vx) (Vy) (Vz) (# (x, j) A 

t$(y,z) =y ^(x,z))] (Vx)(Vy)(#(x,_y) => #(x,x)). 

2.32 Assume that # and ^ are wfs and that x is not free in #. Prove the 
following. 

(a) I- # =y (Vx)# 

(b) F # =* (3x)# 

(c) h (# =» (Vx)#) (Vx) (# #) 

(d) F ((3x)# (Vx) (# =» #) 

We need a derived rule that will allow us to replace a part # of a wf # by 
a wf that is provably equivalent to ( €. For this purpose, we first must prove 
the following auxiliary result. 


LEMMA 2.8 


For any wfs # and I- (Vx) (# ( £) ((Vx)# (Vx)#). 



ADDITIONAL METATHEOREMS AND DERIVED RULES 



proof 


1. 

2 . 

3. 

4. 

5. 

6 . 

7. 

8 . 
9. 

10 . 

11 . 


(Vx) (PS 44 ( &) 

(Vx)# 

# 44 <£ 

PS 

(Vx0 

(Vx) (PS 44 #), (Vx)# E (Vx)tf 
(Vx) (PS 44* tf) E (Vx)# =4 (Vx)# 

(Vx) (# 44 #) E (Vx) r <? =4 (Vx)# 

(Vx) (# 44* #) E (Vx)# 44 (Vx)# 
h (Vx) (# 44* #) =4* ((Vx)# 44 (Vx)#) 


Hyp 

Hyp 

1, rule A4 

2, rule A4 

3, 4, biconditional elimination 
5, Gen 

1-6 

1-7, Corollary 2.6 

Proof like that of 8 

8, 9, Bioconditional introduction 

1—10, Corollary 2.6 


PROPOSITION 2.9 

If # is a subformula of #, 0 is the result of replacing zero or more oc- 
currences of # in # by a wf <3, and every free variable of # or P3 that is also 
a bound variable of # occurs in the list yi, . .. ,yt, then: 

(a) E [(Yvi) . . . (Vy k )0 ^ ®)] =4* (PS 44 0) (Equivalence theorem) 

(b) If E ^ 44 S', then E # 44* 0 (Replacement theorem) 

(c) If E # 44* S and E #, then E 0 


Example 

(a) h (V*)(/Ij(*) » (A\{x)) =* [(344|(x) ^ (3x) 4(x) } 


Proof 

(a) We use induction on the number of connectives and quantifiers in PS. 
Note that, if zero occurrences are replaced, 0 is # and the wf to be proved 
is an instance of the tautology A =4* (B 44 B). Note also that, if # is identical 
with PA and this occurrence of # is replaced by S, the wf to be proved, 
[(Yyi) . . . (0yk)(0 44* S)] =4 (PS 44 0), is derivable by Exercise 2.27(d). 
Thus, we may assume that # is a proper part of PS and that at least one 
occurrence of # is replaced. Our inductive hypothesis is that the result holds 
for all wfs with fewer connectives and quantifiers than PS. 

Case 1. PS is an atomic wf. Then # cannot be a proper part of PS. 

Case 2. PS is ->S\ Let 0 be . By inductive hypothesis, I- [(Vyi) . . . (Vy*) 
{0 44* <3)] =4 (S 44 S 1 '). Hence, by a suitable instance of the tautology 
(C =4* (A 44* R)) =4* (C =4 (-i A 44* ->£)) and MP, we obtain E [(Vyi) . . . (Vyt) 
0 44* 2#)] =4> (# 44 0). 
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Case J. 83 is $ =j> . Let 0 be S' => «5F'. By inductive hypothesis 

I- [(Vw). . - 06*) ^ 0)] => (S ^ O and b [(Vy,) . . . (Vjft)(<«r ^ 9)\ => (j^ 
^ 0). Using a suitable instance of the tautology 

(A C)) A (A^{D& E)) => (A ^ \{B =>- D) & (C =^E)]) 

we obtain b [(Vyi) . . . 06ft) (# ^ ^)] => (83 <& 0). 

Case 4. ik is (Vx)<f. Let 0 be ( Vx) 6 J . By inductive hypothesis, 
b [(Vyi) ■ ■ - 06*) 0*? ^ ^)] =>{$<& S'). Now, x does not occur free i n 
(Vyi) . . . 06ft) ^ because, if it did, it would be free in or 9 and, 

since it is bound in 83, it would be one ofyi, ■ . ■ ,yk and it would not be free 
in (Vyi) . . . (V>ft) <& 9). Hence, using axiom (A5), we obtain b (Vyi) . . . 
(V>ft) 9) => (Vx) (S S'). However, by Lemma 2.8, b (Vx) (S s') 
((Vx)^ ^ (Vx)^). Then, by a suitable tautology and MP, b [(Vy t ) 

• - * (Vjft)(^ ^ 9)] =>(&<=> 0). 

(b) From V <8 <=$ 9, by several applications of Gen, we obtain 

b (Vyi) ■ • ■ 06ft) <=* $})' Then, by (a) and MP, b 88 & 0 . 

(c) Use part (b) and biconditional elimination. 


Exercises 

2.33 Prove the following: 

(a) b (3x)->83 <& ~>(\/x)& 

(b) b Qfa)& <* -.(3*)-.^ 

(c) b (3x)(^ =» ^ V 9)) =* (3x)(83 -itf A ->9) 

(d) b (Vx)(3y)(<0 ( €) & (Vx)(3>-)(-^ V r 8) 

(e) b (Vx)(^ => -fi) -i (3x ) {44 A <£) 

2.34 Show by a counterexample that we cannot omit the quantifiers 
(Vyi) . . . (V>ft) i n Proposition 2.9(a). 

2.35 If <8 is obtained from 83 by erasing all quantifiers (Vx) or (3x) whose 
scope does not contain x free, prove that b 88 & ( 8. 

2.36 For each wf 8ft below, find awf^ such that b <8 <=$■ ~>83 and negation 
signs in ^ apply only to atomic wfs. 

(a) (Vx)(V>)(3z)/4j(x,y,z) 

(b) (Vc)(f > 0 =4- (3<5)(<5 > 0 A (Vx)(|x — cj < <5 |/(x) —/(c) I < e )) 

(c) (Ve)(e > 0 (3n)(\f?n)(m > n ==> \a m — b\ < e)) 

2.37 Let 83 be a wf that does not contain => and <£>. Exchange universal 
and existential quantifiers and exchange A and V. The result 0 is called the 
dual of 88. 

(a) In any predicate calculus, prove the following. 

(i) b 88 if and only if I — >88* 

(ii) b 88 => <€ if and only if b <e 83*. 

(iii) b 88 «=> <8 if and only if b 88* 

(iv) b {3x)(83 8 ( 8) <^> [((3x)83) V (3x)^]. [Hint: Use Exercise 2.27(c).] 
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*(b) Show that the duality results of part (a), (i) (iii), do not hold for ar- 
bitrary theories. 
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2.6 RULE C 


It is very common in mathematics to reason in the following way. Assume 
that, we have proved a wf of the form (3x)${x). Then we say, let b be an 
object such that t%(b). We continue the proof, finally arriving at a formula 
that does not involve the arbitrarily chosen element b. 

For example, let us say that we wish to show that 
(3x)(^?(x) => #(*)), (Vx)^(x) I- (3x)^(x). 


1. (3x)(#(x) =* ^(x)) Hyp 

2. (Vx)^(x) Hyp 

3 g${b) ==> %>(b) for some b 1 

4. (%\b) 2, rule A4 

5. <${b) 3, 4, MP 

6. (3x)^(x) 5, rule E4 


Such a proof seems to be perfectly legitimate on an intuitive basis. In fact, 
we can achieve the same result without making an arbitrary choice of an 
element b as in step 3. This can be done as follows: 


1. (Vx)^(x) 

2. (Vx)->^(x) 

3. h(x) 

4. -M{x) 

5. -i(&(x) =» V(x)) 

6. (Vx)-i(^(x) => ^(x)) 

7. (Vx)^(x), (Vx)-^(x) h 
(Vx)-<(^(x) => r <^(x)) 

8. (Vx)^?(x) h (Vx)-i^’(x) 

=4- (Vx)->(.^(x) => (x)) 

9. (Vx)^(x) I — i(Vx)-'(^(x) 
=» tf(x)) =* -i(Vx)-i^(x) 

10. (Vx)^(x) h (3x)(#(x) =» 
^(x)) => (3v)^(x) 

11. (3x)(^(x) =» ^(x)), 
(Vx)^(x) h (3x)^(x) 


Hyp 

Hyp 

1, rule A4 

2, rule A4 

3, 4, conditional introduction 
5, Gen 

1 6 

l 7, corollary 2.6 
8. contrapositive 
Abbreviation of 9 
10, MP 


In general, any wf that can be proved using a finite number of arbitrary 
choices can also be proved without such acts of choice. We shall call the rule 
that permits us to go from (ELx)^(x) to 3S{b), rule C (‘C’ for ‘choice’). More 
precisely, a rule C deduction in a first-order theory K is defined in the 
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following manner: F he ^ if and only if there is a sequence of wfs i , • • • , 3> tt 
such that <2) n is $ and the following four conditions hold: 


1. For each i < n, either 

(a) !3j is an axiom of K, or 

(b) is in T, or 

(c) 9i)i follows by MP or Gen from preceding wfs in the sequence, or 

(d) there is a preceding wf (3x)^(x) such that <2>i is r d(d), where d is a new 
individual constant (rule C). 

2. As axioms in condition 1(a), we also can use all logical axioms that 
involve the new individual constants already introduced in the sequence 
by applications of rule C. 

3. No application of Gen is made using a variable that is free in some 
(3x)^(x) to which rule C has been previously applied. 

4. $ contains none of the new individual constants introduced in the se- 
quence in any application of rule C. 

A word should be said about the reason for including condition 3. If an 
application of rule C to a wf (3x) < <f(x) yields ^(d), then the object referred to 
by d may depend on the values of the free variables in (3x)^(x). So that one 
object may not satisfy ( d(x) for all values of the free variables in (3x)^(x). 
For example, without clause 3, we could proceed as follows: 


1. (Vx)(3y>4j(x,j) Hyp 

2. (3y)Al(x,y) 1, rule A4 

3. A-{(x,d) 2, rule C 

4. (Vx)/4i(x, d) 3, Gen 

5. (3j>)(Vx)4?(x,j>) 4, rule E4 


However, there is an interpretation for which (Vx)(3j/)4j(x, j;) is true but 
(3 j;)(Vx)4j(x,j;) is false. Take the domain to be the set of integers and let 
Aj (x, >>) mean that x < y. 


PROPOSmON 2.10 

If r he then r h Moreover, from the following proof it is easy to 
verify that, if there is an application of Gen in the new proof of from T 
using a certain variable and applied to a wf depending upon a certain wf of 
T, then there was such an application of Gen in the original proof.! 


tThe first formulation of a version of rule C similar to that given here seems to 
be due to Rosser (1953). 




RULE C 


r 



proof 


Let (3y\)%>\(yi), ■ ■ . , {^ykY^kiyk) be the wfs in order of occurrence to which 
rule C is applied in the proof of T he and let di , . . . , d* be the corre- 
sponding new individual constants. Then T, ^i(di), . . . L Now, 

hy condition 3 of the definition above, Corollary 2.6 is applicable, yielding 
r.tfi W>--- j ^ — i i ) h Wc replace dj^ everywhere by a 

variable z that does not occur in the proof. 

Then 


T, Vi (dj), . . . , tf*_i (d*_,) h <?*(z) =* ^ 


and, by Gen, 

r,^(d, ),..., h (Vz)(<?*(z) => 39) 

Hence, by Exercise 2.32(d), 

T,^i(di), . . . h {3yk)^k{yk) => ^ 


But, 




Hence, by MP, 

r,^i(d!),...,^_i(4-i)h^ 

Repeating this argument, we can eliminate • • • ^\{d\) one after 

the other, finally obtaining fl-^. 


Example 

h (Vx)(^(x) #(x)) => ((3x)^(x) => (Tr)^(x)) 


1. (Vx)(^x) => x )) 

2. (3x)^(x) 

3. @{d) 

4. <^(d) c E{d) 

5. ^(d) 

6. (3x)^(x) 

7. (Vx)(^(x) (x) ) , (3x)^(x) h c (3x)^(x) 

8. (Vx)(^(x) =» tf(x)), (3x)^(x) h (3x)^(x) 

9. (Vx)(^(x) =y ^(x)) h (3x)^(x) => (3x)^(x) 

10. h (Vx)(^(x) => ^(x)) ((3x)^(x) => (3x)^(x)) 


Hyp 

Hyp 

2, rule C 

1 , rule A4 

3, 4, MP 
5, rule E4 
1-6 

7, Proposition 2.10 
1 — 8, corollary 2.6 
1 — 9, corollary 2.6 


Exercises 

Use rule C and Proposition 2.10 to prove Exercises 2.38-2.45. 

2.38 h (3x)^(x) f 6(x)) => ((Vx)^(x) (3x)^(x)) 

2.39 h ->(3y)(Vx)(T5(x,y) -nTf(x,x)) 
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2.40 h [(Vx) (A } (x) => A\ (x) V A\ (x) ) A -i (Vx) (A } (x) => A \ (x))] 
=> (3x) (A | (x) Av4j(x)) 

2.41 I- [(3x)^(x)] A [(Yx)^(x)] (3x)(.^?(x) A ^(x)) 

2.42 h (3x)#(x) =* (3x)(#(x) V r 6%x)) 

2.43 h (3x)(3y);^(x,>’) (3 y)(3x)0S(x,y) 

2.44 I- (3 x)(Vy)^(x,y) => (Vy)(3x)^(x,y) 

2.45 F (3x)(^(x) A <&(x)) =* ((3x)0$(x)) A (3x)^(x) 

2.46 What is wrong with the following alleged derivations? 


(a) 1. (3x)^(x) 

2. ®{d) 

3. (3x)V(x) 

4. <g(d) 

5. 9&(d) a <<g{d) 

6. (3x)(^(x) A <g{x)) 

7. (3x)^(x), (3x)^(x) 

F (3x)(.'^(x) A r A(x)) 

(b) 1. (3x)(^(x) =» tf(x)) 

2. (3x)£$(x) 

3. @(d) => <€{d) 

4. d{d) 

5. ( €{d) 

6. (3x)#(x) 

7. (3x)(^(x) =><&(x)), 
(3x)d&(x) F (3x) r A(x) 


Hyp 

1, rule C 

Hyp 

3, rule C 

2, 4, conjunction introduction 
5, rule E4 

1-6, Proposition 2.10 

Hyp 

Hyp 

1, rule C 

2, rule C 

3, 4, MP 
5, rule E4 

H6, Proposition 2.10 


2.7 COMPLETENESS THEOREMS 

* 


We intend to show that the theorems of a first-order predicate calculus K are 
precisely the same as the logically valid wfs of K. Half of this result was proved 
in Proposition 2.2. The other half will follow from a much more general 
proposition established later. First we must prove a few preliminary lemmas. 

If Xj and Xj are distinct, then ^(x,) and 3A(xj) are said to be similar if and 
only if Xj is free for x,- in dA(xi) and 03{xi) has no free occurrences of xj. It is 
assumed here that 3S{xj) arises from 38 (xj) by substituting xj for all free 
occurrences of x,-. It is easy to see that, if 88{xi) and d%(xj) are similar, then x,- 
is free for Xj in &(Xj) and 38 (xj) has no free occurrences of x,-. Thus, if ^(x,-) 
and 38 (xy) are similar, then 38 (xj) and 38(xi) are similar. Intuitively, 38 (xi) 
and dS(xj) are similar if and only if ^(x,-) and .^(xy) are the same except that 
&(xi) has free occurrences of x, in exactly those places where 38 (xj) has free 
occurrences of Xj. 

Example 

(Vx 3 )[^j(xi,X 3 ) Vyt}(xi)] and (Vx 3 )[/ti(x 2 ,x 3 ) V/t{(x 2 )] are similar. 
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LEMMA 2.11 

If Pd(xj) and Pd{xf) are similar, then E (\/Xj)$(xj) <=> (Vxy)^(xy). 

Proof 

|- (\'Xi)${xj) => ${xj ) by axiom (A4). Then, by Gen, E (Vxy)((Vx,)^?(x,) 
&{xj)), and so, by axiom (A5) and MP, I- (Vx,-)^(x,-) => (Vxy)^(xy). Si- 
milarly, E (Vxy)^(xy) => (Vx,)^(x,-). Hence, by biconditional introduction, 
E (VXj)^(x/) <=v (Vxy)^(Xy). 

Exercises 

2.47 If ^(x r ) and ^(xy) are similar, prove that I- (3x,)^(x,-) 44- (3xy)^?(xy). 

2.48 Change of bound variables. If Pd(x) is similar to Pd(y), (Vx)<$(x) is a 
subformula of and ^ is the result of replacing one or more occurrences of 
(Vx)^(x) in r 6 ? by (Vy)^(y), prove that \- r 6 ? <=> r 6 ? '. 

LEMMA 2.12 

If a closed wf of a theory K is not provable in K, and if K 7 is the theory 
obtained from K by adding Pd as a new axiom, then K' is consistent. 

Proof 

Assume EC' inconsistent. Then, for some wf E K / ^ and E K ' Now, 
•“K' ^ (-dg => -ipS) by Proposition 2.1. So, by two applications of MP, 
I-k' ->Pd. Now, any use of Pd as an axiom in a proof in K 7 can be regarded as 
a hypothesis in a proof in K. Hence, Pd Ek -'Pd. Since Pd is closed, we have 
Ek => ~^Pd by Corollary 2.7. However, by Proposition 2.1, 
E k (id => -\p$) => -ipti. Therefore, by MP, Ek -'Pd, contradicting our hy- 
pothesis. 

Exercise 

2.49 If a closed wf PS of a theory K is not provable in K, and if K / is the 
theory obtained from K by adding -$d as a new axiom, then K 7 is consistent. 

LEMMA 2.13 

The set of expressions of a language SP is denumerable. Hence, the same is 
true of the set of terms, the set of wfs and the set of closed wfs. 



Proof 




First assign a distinct positive integer g(u) to each symbol u as follows 1 
g( () = 3, <?()) = 5, ff (,)=7, 0 H=9 , e(+) = ll, e(V) = 13, 0(x t ) = 

13 + 8*, 9 (o t ) = 7 + 81, g(^) = 1 + 8(2''3<), and g(A;) = 3 + 8(2»3*1. 
Then, to an expression UqU\ . . . u, associate the number 2 a ^ 1 ' 0 )3 . . pf" r) ^ 
where pj is the yth prime number, starting with p Q = 2. (Example: the 
number of /!} (*2) is 2 5l 3 3 5 29 7 5 .) We can enumerate all expressions in 
the order of their associated numbers; so, the set of expressions is denu- 
merable. 

If we can effectively tell whether any given symbol is a symbol of then 
this enumeration can be effectively carried out, and, in addition, we can 
effectively decide whether any given number is the number of an expression 
of if. The same holds true for terms, wfs and closed wfs. If a theory K in 
the language if is axiomatic, that is, if we can effectively decide whether 
any given wf is an axiom of K, then we can effectively enumerate the 
theorems of K in the following manner. Starting with a list consisting of the 
first axiom of K in the enumeration just specified, add to the list all the 
direct consequences of this axiom by MP and by Gen used only once and 
with jci as quantified variable. Add the second axiom to this new list and 
write all new direct consequences by MP and Gen of the wfs in this aug- 
mented list, with Gen used only once and with x\ and x-i as quantified 
variables. If at the kt h step we add the /cth axiom and apply MP and Gen to 
the wfs in the new list (with Gen applied only once for each of the variables 
, . . . , x*), we eventually obtain in this manner all theorems of K. However, 
in contradistinction to the case of expressions, terms, wfs and closed wfs, it 
turns out that there are axiomatic theories K for which we cannot tell in 
advance whether any given wf of K will eventually appear in the list of 
theorems. 


DEFINITIONS 

(i) A theory K is said to be complete if, for every closed wf of K, either 
Fk 3$ or Fk. 

(ii) A theory K / is said to be an extension of a theory K if every theorem of 
K is a theorem of K 7 . (We also say in such a case that K is a subtheory 
of K'.) 

PROPOSITION 2.14 (LINDENBAUM’S LEMMA) 

If K is a consistent theory, then there is a consistent, complete extension 
of K. 




COMPLETENESS THEOREMS 


87 


proof 

Let ... be an enumeration of all closed wfs of the language of K, by 

Lemma 2. 1 3. Define a sequence Jo, Ji , h, • • • of theories in the following way. 
j 0 is K. Assume J„ is defined, with n ^ 0 . If it is not the case that Ej„ ->PH n+ i, 
then let J„+i be obtained from J„ by adding PA n , i as an additional axiom. On 
the other hand, if Ej n ->PA„ + \, let J„ + i = J„. Let J be the theory obtained by 
taking as axioms all the axioms of all the J,s. Clearly, J,- +l is an extension of 
j h and J is an extension of all the J,s including Jo = K. To show that J is 
consistent, it suffices to prove that every J, is consistent because a proof of a 
contradiction in J, involving as it does only a finite number of axioms, is also 
a proof of a contradition in some J, . We prove the consistency of the J,s, by 
induction. By hypothesis, Jq = K is consistent. Assume that J, is consistent. 
If J /+ i = J „ then J /+ i is consistent. If J, / J, + i, and therefore, by the defi- 
nition of J I +i,-'^/+i is not provable in J,, then, by Lemma 2.12, J I+ i is also 
consistent. So, we have proved that all the J ; s are consistent and, therefore, 
that J is consistent. To prove the completeness of J, let # be any closed wf of 
K. Then = P8j+i for some j^O. Now, either Ej. ->P$j + ] or Ej y+1 PSj + \, since, 
if it is not the case that Ej. 1 , then PAj+\ is added as an axiom in Jy +1 . 

Therefore, either Ej ->38j+ \ or Ej PAj + \ . Thus, J is complete. 

Note that even if one can effectively determine whether any wf is an 
axiom of IC, it may not be possible to do the same with (or even to enu- 
merate effectively) the axioms of J ; that is, J may not be axiomatic even if K 
is. This is due to the possibility of not being able to determine, at each step, 
whether or not is provable in J H . 

Exercises 

2.49 Show that a theory K is complete if and only if, for any closed wfs PA 
and # of K, if Hk PA V then h K PA or Ek 

2.50° Prove that every consistent decidable theory has a consistent, decid- 
able, complete extension. 

DEFINITIONS 

1. A closed term is a term without variables. 

2. A theory K is a scapegoat theory if, for any wf PA(x) that has jc as its only 
free variable, there is a closed term t such that 

Ek (3.y) ->iA{x) 

LEMMA 2.15 

Every consistent theory K has a consistent extension K 7 such that K 7 is a 
scapegoat theory and K 7 contains denumerably many closed terms. 
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Proof 

Add to the symbols of K a denumerable set { A i , 62 , . . .} of new individual 
constants. Call this new theory IQ. Its axioms are those of K plus those 
logical axioms that involve the symbols of K and the new constants. IQ, j s 
consistent. For, if not, there is a proof in K 0 of a wf $ A ->3d. Replace each 
bi appearing in this proof by a variable that does not appear in the proof. 
This transforms axioms into axioms and preserves the correctness of the 
applications of the rules of inference. The final wf in the proof is still a 
contradiction, but now the proof does not involve any of the fc,s and 
therefore is a proof in K. This contradicts the consistency of K. Hence, IQ 
is consistent. 

By Lemma 2.13, let F\ (x,-, ) , F 2 (x, 2 ) , . .. , F k (xi k ), . . . be an enumeration of 
all wfs of Ko that have one free variable. Choose a sequence ,bj 2 ,... of 
some of the new individual constants such that each b Jk is not contained in 
any of the wfs (*,-,), . . . , F k (x ik ) and such that b Jk is different from each of 
bj n , bj k l . Consider the wf 

(S k ) (3x 4 )^fa) => -^F k (b h ) 

Let IQ be the theory obtained by adding (Si),..., (S„) to the axioms of IQ, 
and let K l30 be the theory obtained by adding all the (.S^s as axioms to IQ. 
Any proof in EC*, contains only a finite number of the (S',-) s and, therefore, 
will also be a proof in some IQ. Hence, if all the IQs are consistent, so is 
IQo. To demonstrate that all the IQs are consistent, proceed by induction. 
We know that K 0 is consistent. Assume that IQ,_i is consistent but that IQ is 
inconsistent (n^l). Then, as we know, any wf is provable in IQ, (by the 
tautology -A (A => B), Proposition 2.1 and MP). In particular, 
Hk„ _, (Q)- Hence, (S„) Fk„_i _ 1 (*Q)- Since (£„) is closed, we have, by Cor- 
ollary 2.7, Hk„_i (S n ) => - , (*Q). But, by the tautology (A => ->A) =>- -A, 
Proposition 2.1 and MP, we then have l-^, -1 (S „ ) ; that is, 
h K„_, -n F n (x i/: ) => ~'F„(bj ll )]. Now, by conditional elimination, we 

obtain \- Kii l (3x in )^F„(x in ) and h Kn , -n^F„ (b jn ) , and then, by negation 
elimination, h K t F„(bj n ). From the latter and the fact that bj n does not occur 
in (So), . .. , (S„- 1 ), we conclude h Kj) _ , F n (x r ), where x r is a variable that does 
not occur in the proof of F„(bj n ). (Simply replace in the proof all occurrences 
of b Jn by x r .) By Gen, Fk,,_i (Vjc,.) F„ (x r ) , and then, by Lemma 2.11 and 
biconditional elimination, h K); , ( Vx, H )F n (x i>t ) . (We use the fact that F„(x r ) 
and F„ (x ir ) are similar.) But we already have I-k^i which is an 

abbreviation of -> (VQ ) ->~>F ll (x iit ) , whence, by the replacement theorem, 
Fjq,., “’(V'-y i„)F„(x in ), contradicting the hypothesis that IQ,_i is consistent. 
Hence, IQ, must also be consistent. Thus IQo is consistent, it is an extension 
of IQ and it is clearly a scapegoat theory. 
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LEMMA 2.16 



Let J be a consistent, complete scapegoat theory. Then J has a model M 
whose domain is the set D of closed terms of J. 


proof 

for any individual constant of J, let («,) M = a,-. For any function letter fj* 
of J and for any closed terms t\,...t n of J, let ■ . . , t„) = 

. . . , t n ). (Notice that fj{t \ , . . . . t„) is a closed term. Hence, (f k ) is an 
ary operation on D.) For any predicate letter A k of J, let {A k ) consist of 
all /t-tuples {/i, . . . , t„) of closed terms ... ,t„ of J such that 
hj . . . , t„). It now suffices to show that, for any closed wf ^ of J: 

(□) |=m ^ if and only if Ej <6 

(If this is established and $ is any axiom of J, let # be the closure of $. By 
Gen, Fj ( C. By (□), \=m ( C- By (VI) on page 61, \=m Hence, M would be a 
model of J.) The proof of (□) is by induction on the number r of con- 
nectives and quantifiers in ( C. Assume that (□) holds for all closed wfs with 
fewer than r connectives and quantifiers. 

Case 1. C is a closed atomic wf A k (^ , . . . ,t„). Then (□) is a direct con- 
sequence of the definition of {A n k ) M . 

Case 2. is ->3. If ^ is true for M, then 3 is false for M and so, by 
inductive hypothesis, not-Fj 3. Since J is complete and 3 is closed, Fj -*3 - 
that is, Fj ( 6. Conversely, if ^ is not true for M, then (A is true for M. Hence, 
Fj 3. Since J is consistent, not-Fj ->3, that is, not-Fj ( C. 

Case 3. ^ is 3 ==> S. Since ^ is closed, so are 3 and S. If *C is false for M, 
then 3 is true and $ is false. Hence, by inductive hypothesis, Fj 3 and not- 
Fj S. By the completeness of J, Fj Therefore, by an instance of the 
tautology D => (~iE => ->{D => E)) and two applications of MP, 
Fj ->(3 =e S'), that is, Fj -AC, and so, by the consistency of J, not-Fj ( C. 
Conversely, if not-Fj < C, then, by the completeness of J, Fj that is, 
Fj —>{3 =s>- S). By conditional elimination, Fj 3 and Fj —>S. Hence, by (□) 
for 3, 3 is true for M. By the consistency of J, not-Fj S and, therefore, by 
(□) for S, $ is false for M. Thus, since 3 is true for M and S is false for M, 
is false for M. 

Case 4. AA is (\lx m )3. 

Case 4a. 3 is a closed wf. By inductive hypothesis, [=m 3 if and only 
if Fj 3. By Exercise 2.32(a), Fj 3 (Yx„, ) 3 . So, Fj 3 if and only if 
Fj (yx,„)3 , by biconditional elimination. Moreover, [=m 3 if and only 
if [=m (Vjc m )3 by property (VI) on page 61. Hence, [=m C if and only if 
Fj ( C. 

Case 4b. 3 is not a closed wf. Since ^ is closed, 3 has x m as its only free 
variable, say 3 is F(x m ). Then r 6 ? is (yx m )F(x m ). 
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(i) Assume |=m ^ and not-bj ( F. By the completeness of J, bj -><#, that is, 

l-j -i(Vx m )F(x m ). Then, by Exercise 2.33(a) and biconditional elimina- 
tion, bj (3x w )-iF(x ot ). Since J is a scapegoat theory, bj ~>F(t) for some 
closed term t of J. But [=m that is, \= M (\fx m )F(x m ). Since 
(Vx m )F(x m ) F(t) is true for M by property (X) on page 63, \= M F(t). 
Hence, by (□) for F(t), bj F(t). This contradicts the consistency of J. 
Thus, if then, bj 

(ii) Assume bj ^ and not-[=M Thus, 

(#) bj (Vx„,)F(x, ,,)(##) not- Hm (Vx m )F(x m ). 

By (##), some sequence of elements of the domain D does not satisfy 
(\fx m )F(x m ). Hence, some sequence s does not satisfy F(x m ). Let t be the fth 
component of s. Notice that s*(u) = u for all closed terms u of J (by the 
definition of (a,) M and Observe also that F(t) has fewer connectives 

and quantifiers than and, therefore, the inductive hypothesis applies to 
F(r), that is, (□) holds for F{t). Hence, by Lemma 2(a) on page 63, s does 
not satisfy F(t). So, F(t) is false for M. But, by (#) and rule A4, bj F(t), and 
so, by (□) for F(t), F(t). This contradiction shows that, if bj then 

\=M 
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Now we can prove the fundamental theorem of quantification theory. 
By a denumerable model we mean a model in which the domain is de- 
numerable. 

PROPOSITION 2.17t 

Every consistent theory K has a denumerable model. 

Proof 

By Lemma 2.15, K has a consistent extension K 7 such that K' is a scapegoat 
theory and has denumerably many closed terms. By Lindenbaum’s lemma, 
K' has a consistent, complete extension J that has the same symbols at K'. 
Hence, J is also a scapegoat theory. By Lemma 2.16, J has a model M whose 
domain is the denumerable set of closed terms of J. Since J is an extension of 
IC, M is a denumerable model of K. 


iThe proof given here is essentially due to Henkin (1949), as simplified by 
Hasenjaeger (1953). The result was originally proved by Godel (1930). Other proofs 
have been published by Rasiowa and Sikorski (1951; 1952) and Beth (1951), using 
(Boolean) algebraic and topological methods, respectively. Still other proofs may be 
found in Hintikka (1955a, b) and in Beth (1959). 
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COROLLARY 2.18 

Any logically valid wf 38 of a theory IC is a theorem of K. 

proof 

We need consider only closed wfs 38, since a wf PA is logically valid if and 
only if its closure is logically valid, and $ is provable in K if and only if its 
closure is provable in K. So, let 38 be a logically valid closed wf of K. 
Assume that not-bK 38. By Lemma 2. 12, if we add ~>38 as a new axiom to IC, 
the new theory K 7 is consistent. Hence, by Proposition 2.17, EC 7 has a model 
M. Since ->38 is an axiom of IC', ->38 is true for M. But, since PA is logically 
valid, 38 is true for M. Hence, 38 is both true and false for M, which is 
impossible (by (II) on page 61). Thus, PA must be a theorem of K. 

COROLLARY 2.19. (GODEL’S COMPLETENESS THEOREM, 1930) 
In any predicate calculus, the theorems are precisely the logically valid wfs. 

Proof 

This follows from Proposition 2.2 and Corollary 2.18. (Godel’s original 
proof runs along quite different lines. For other proofs, see Beth (1951), 
Dreben (1952), Hintikka (1955a, b) and Rasiowa and Sikorski (1951; 1952).) 

COROLLARY 2.20 
Let K be any theory. 

(a) A wf PA is true in every denumerable model of IC if and only if b K 38. 

(b) If, in every model of IC, every sequence that satisfies all wfs in a set T of 
wfs also satisfies a wf 38, then T b K 38. 

(c) If a wf 38 of K is a logical consequence of a set T of wfs of IC, then 

r e k 38. 

(d) If a wf 38 of IC is a logical consequence of a wf # of IC, then ^ I~k 38. 

Proof 

(a) We may assume 38 is closed. If not-bK 38, then the theory 
K 7 = K + {->$?} is consistent.! Hence, by Proposition 2.17, K 7 has a 
denumerable model M. However, ->38, being an axiom of IC', is true for 

tlf K is a theory and A is a set of wfs of K, then K + A denotes the theory 
obtained from K by adding the wfs of A as axioms. 
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M. By hypothesis, since M is a denumerable model of K, 38 is true for 
M. Therefore, 38 is true and false for M, which is impossible. 

(b) Consider the theory K + T. By the hypothesis, 3ft is true for every model 
of this theory. Hence, by (a), \ K+r 38- So, F h K 3ft. 

Part (c) is a consequence of (b), and part (d) is a special case of (c). 

Corollaries 2.18-2.20 show that the ‘syntactical’ approach to quantifi- 
cation theory by means of first-order theories is equivalent to the ‘seman- 
tical’ approach through the notions of interpretations, models, logical 
validity, and so on. For the propositional calculus, Corollary 1.15 demon- 
strated the analogous equivalence between the semantical notion (tautology) 
and the syntactical notion (theorem of L). Notice also that, in the propo- 
sitional calculus, the completeness of the system L (see Proposition 1.14) led 
to a solution of the decision problem. However, for first-order theories, we 
cannot obtain a decision procedure for logical validity or, equivalently, for 
provability in first-order predicate calculi. We shall prove this and related 
results in Section 3.6. 


COROLLARY 2.21. (SKOLEM-L O WENHEIM THEOREM, 1920, 1915) 
Any theory that has a model has a denumerable model. 

Proof 

If K has a model, then K is consistent, since no wf can be both true and false 
for the same model M. Hence, by Proposition 2.17, K has a denumerable 
model. 

The following stronger consequence of Proposition 2.17 is derivable. 
COROLLARY 2.22 A 

For any cardinal number rn^Po, any consistent theory K has a model of 
cardinality rn. 

Proof 

By Proposition 2.17, we know that K has a denumerable model. Therefore, 
it suffices to prove the following lemma. 

LEMMA 

If m and n are two cardinal numbers such that m ^ n and if K has a model of 
cardinality m, then K has a model of cardinality n. 
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proof 

Let M be a model of K with domain D of cardinality m. Let D' be a set of 
cardinality n that contains D. Extend the model M to an interpretation M' 
that has D' as domain in the following way. Let c be a fixed element of D. 
We stipulate that the elements of D' — D behave like c. For example, if B" is 

* f 

the interpretation in M of the predicate letter A ” and (B f j ) is the new in- 
terpretation in M', then for any d \ , . . . , d„ in D', (B"-) 1 holds for (d \ , . . . , d„) if 
and only if B n . holds for («i, . . . , u „ ) , where i/ ( - — dj if d { G D and u { = c if 
D. The interpretation of the function letters is extended in an 
analogous way, and the individual constants have the same interpretations 
as in M. It is an easy exercise to show, by induction on the number of 
connectives and quantifiers in a wf that 28 is true for M' if and only if it is 
true for M. Hence, M' is a model of K of cardinality n. 


Exercises 

2.51 For any theory K, if T h K & and each wf in Y is true for a model M 
of K, show that 28 is true for M. 

2.52 If a wf 28 without quantifiers is provable in a predicate calculus, prove 
that 28 is an instance of a tautology and, hence, by Proposition 2. 1 , has a 
proof without quantifiers using only axioms (A1)-(A3) and MP. [Hint: if 28 
were not a tautology, one could construct an interpretation, having the set 
of terms that occur in 28 as its domain, for which 28 is not true, contradicting 
Proposition 2.2.] Note that this implies the consistency of the predicate 
calculus and also provides a decision procedure for the provability of wfs 
without quantifiers. 

2.53 Show that Ek $ if and only if there is a wf # that is the closure of the 
conjunction of some axioms of K such that ^ ^ 28 is logically valid. 

2.54 Compactness. If all finite subsets of the set of axioms of a theory K 
have models, prove that K has a model. 

2.55 (a) For any wf 28, prove that there is only a finite number of inter- 

pretations of 28 on a given domain of finite cardinality k. 

(b) For any wf 28, prove that there is an effective way of determining 
whether 28 is true for all interpretations with domain of some fixed 
cardinality k. 

(c) Let a wf 28 be called k- valid if it is true for all interpretations that 
have a domain of k elements. Call 28 precisely k-valid if it is k-valid 
but not {k+ l)-valid. Show that (k + Invalidity implies A- validity 
and give an example of a wf that is precisely A:- valid. (See Hilbert 
and Bernays (1934, § 4—5) and Wajsberg (1933).) 

2.56 Show that the following wf is true for all finite domains but is false 
for some infinite domain. 
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(V*)(VjO(Vz)[^(x,a-) A (A 2 i(x,y) A Ar x {y,z) =► A^(x,z)) A (Aj(x,y) VAj(y,x))] 


2.57 Prove that there is no theory K whose models are exactly the inter- 
pretations with finite domains. 

2.58 Let 38 be any wf that contains no quantifiers, function letters, or 
individual constants. 

(a) Show that a closed prenex wf (Vx‘i ) . . . (Vx„)(3yi) . . . (3y m )38, with m ^ () 
and 77 ^ 1 , is logically valid if and only if it is true for every inter- 
pretation with a domain of n objects. 

(b) Prove that a closed prenex wf (3yi) . . . (3y, n )38 is logically valid if and 
only if it is true for all interpretations with a domain of one element. 

(c) Show that there is an effective procedure to determine the logical va- 
lidity of all wfs of the forms given in (a) and (b). 

2.59 Let Kj and IC 2 be theories in the same language . Assume that any 
interpretation M of 3A is a model of Kj if and only if M is not a model of 
K. 2 . Prove that Kj and IC 2 are finitely axiomatizable, that is, there are finite 
sets of sentences T and A such that, for any sentence Fk, 38 if and only if 
rh^, and I-k 2 38 if and only if A F 38J 

2.60 A set r of sentences is called an independent axiomatization of a 
theory K if (a) all sentences in T are theorems of K, (b) T (- 38 for every 
theorem 38 of K, and (c) for every sentence C S of T, it is not the case that 
r — { c d} \~ K Prove that every theory K has an independent axiomatiza- 
tion. 

2.61 A If, for some cardinal Ko, a wf $ is true for every interpretation of 
cardinality m, prove that 38 is logically valid. 

2.62 a If a wf 38 is true for all interpretations of cardinality rn prove that 38 is 
true for all interpretations of cardinality less than or equal to m. 

2.63 (a) Prove that a theory K is a scapegoat theory if and only if, for any 

wf 38 (x) with x as its only free variable, there is a closed term t such 
that Fk (3x)38(x) =>■ 38(t). 

(b) Prove that a theory IC is a scapegoat theory if and only if, for any 
wf 38(x) with x as its only free variable such that Fk (3x)38(x), 
there is a closed term 1 such that Fk 38 {t). 

(c) Prove that no predicate calculus is a scapegoat theory. 


2.8 FIRST-ORDER THEORIES WITH EQUALITY 

Let K be a theory that has as one of its predicate letters A\. Let us write t = s 
as an abbreviation for A^(t, s), and t ^ s as an abbreviation for -v4?(M). 

1 Here, an expression T F : 38 , without any subscript attached to F, means that 38 
is derivable from T using only logical axioms, that is within the predicate calculus. 



1 : FIRST-ORDER THEORIES WITH EQUALITY 

Then K is called a first-order theory with equality (or simply a theory with 
equality) if the following are theorems of K: 

(A6) (Vx'i)xi = x\ (reflexivity of equality) 

(A7) x = y (^?(x,x) => 3tf(x,y)) (substitutivity of equality) 

where x and y are any variables, $(x,x) is any wf, and .4?(x,y) arises from 
$(x,x) by replacing some, but not necessarily all, free occurrences of x by y, 
with the proviso that y is free for x in 38(x,x). Thus, .#(x,y) may or may not 
contain free occurrences of x. 

The numbering (A6) and (A7) is a continuation of the numbering of the 
logical axioms. 
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PROPOSITION 2.23 

[n any theory with equality, 

(a) h t — t for any term t; 

(b) b t = s ^ s — t for any terms t and s ; 

(c) (- / = s => (s = r => t = r) for any terms t, s and r. 


Proof 

(a) By (A6), h (Vxj) X\ = xj. Hence, by rule A4, h t — t. 

(b) Let x and y be variables nol occurring in t or s. Letting &(x, x) be x = x 
and &(x,y) be y = x in schema (A7), b x = y => (x = * =>- y = x) . But, 
by (a), h x = x. So, by an instance of the tautology (A =4> (B => C)) 

(B =>- (A =>■ C)) and two applications of MP, we have b x = y 
=> y = x. Two applications of Gen yield h (Vx)(Vy)(x — y => y — x), 
and then two applications of rule A4 give b / = 5 =?► s = t 

(c) Let x,y and z be three variables not occurring in t,s, or r. Letting 
&(y,y) be y = z and &(y,x) be x — z in (A 7), with x and y inter- 
changed, we obtain by = x=^(y = z=^x = z). But, by (b), 
\- x = y => y = x. Hence, using an instance of the tautology 
(A =3> B) =>• ((B =>■ C) (A =>• C)) and two applications of MP, we 
obtain h x = y ( y = z => x = z). By three applications of Gen, 
b (Vx) (Vy) (Vz) (x = y =>- (y = z =>- x = z)), and then, by three uses of 
rule A4, b t = s =y (s = r => t = r). 


Exercises 

2.64 Show that (A6) and (A7) are true for any interpretation M in which 
Wr is the identity relation on the domain of the interpretation. 

2.65 Prove the following in any theory with equality. 
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(a) b (Vx)(^(a') 45 (3>)(x =j'A £%(y))) if y does not occur in 2 ft { x ) 

(b) b (Vx)(.^(x) 45 (Vy) (x — y =5 2$(y))) if y does not occur in 2B(x) 

(c) b (Vx')(3y) a- = y 

(d) b x = y =5 f(x) —f(y), where f is any function letter of one argument 

(e) b 2ti(x) Ax = y =5 &(y), if y is free for x in M[x) 

(f) b <2#(x) A -i &&(y) =5 x ^ y, if y is free for x in 98 (x) 

We can reduce schema (A7) to a few simpler cases. 


PROPOSITION 2.24 

Let K be a theory for which (A6) holds and (A7) holds for all atomic wfs 
38(x,x) in which there are no individual constants. Then K is a theory with 
equality, that is, (A 7) holds for all wfs 98(x,x). 


Proof 

We must prove (A7) for' all wfs 98{x,x). It holds for atomic wfs by as- 
sumption. Note that we have the results of Proposition 2.23, since its proof 
used (A7) only with atomic wfs without individual constants. Note also that 
we have (A7) for all atomic wfs 98{x,x). For if 98(x,x) contains individual 
constants, we can replace those individual constants by new variables, ob- 
taining a wf 98* (a - , a - ) without individual constants. By hypothesis, the cor- 
responding instance of (A7) with 98* (jc, v) is a theorem; we can then apply 
Gen with respect to the new variables, and finally apply rule A4 one or more 
times to obtain (A7) with respect to 98(x,x). 

Proceeding by induction on the number n of connectives and quantifiers 
in &(x,x), we assume that (A7) holds for all k < n. 

Case 1. 98(x,x) is —> ( &(x,x). By inductive hypothesis, we have 
by — x =5 (&(x ,y) =5 r 8(x,x)), since ^(x,x) arises from %>(x ,y) by replacing 
some occurrences of y by x. Hence, by Proposition 2.23(b), instances of the 
tautologies (A =5 6) => {—>98 =5 —>A) and ( A =5 B) => {{B =5 C) =5 (A =>- C)) 
and MP, we obtain b x = y =5 (98(x,x) =5 9?(x,y)). 

Case 2. 2fl(x,x) is =5 29(x,x). By inductive hypothesis and Pro- 

position 2.23(b), b x — y = 5 - (^(x,y) =5 c 8{x,x)) and b x =y =5- (&(x,x) 
=5 @(x,y)). Hence, by the tautology (A =5 {C\ =^C))b [(A =5 (D =5 D \ )) 
=5 (A =5 {(C =5 0) =5 (Cj =5 a)))], we have b x = y =5 {28{x, x) =5 98{x,y)). 

Case 3. 98{x,x) is (Vz) r A(x, a, z). By inductive hypothesis, b x = y=b 
( c 9(x,x,z) =5 < £(x,y,z)). Now, by Gen and axiom (A5), hr = y =5 
(Vz)(#(x;,jc,z) =5 < ^ ? (A‘,y,z)). By Exercise 2. 27(a),b (Vz) (^(x,x,z) =5 ^(x,y,z)) 
=5 [(Vz)^(a^ :,x,z) =5 (Vz)#(x,y,z)], and so, by the tautology (>4 B) => 
(( B = 5 - C) =5 (A =5 C)), b x = y =5 (98(x,x) =5 98(x,y)). 

The instances of (A7) can be still further reduced. 
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PROPOSITION 2.25 

Let K be a theory in which (A6) holds and the following are true. 

( a ) Schema (A7) holds for all atomic wfs ^(x,x) such that no function 
letters or individual constants occur in 3fl(x, x) and &(x,y) comes from 
2§(x, x) by replacing exactly one occurrence of x by y. 

(b) h x = y =>• f”{z \ , . . • , z„) w„), where fj is any function 

letter of K, z\ , ...,z„ are variables, and fj{w \ , . . . , w„) arises from 
fj{z\ , . . . ,z„) by replacing exactly one occurrence of x by y. 

Then IC is a theory with equality. 


Proof 

By repeated application, our assumptions can be extended to replacements 
of more than one occurrence of x by y. Also, Proposition 2.23 is still deri- 
vable. By Proposition 2.24, it suffices to prove (A7) for only atomic wfs 
without individual constants. But, hypothesis (a) enables us easily to prove 

E z\ A . . . f\y„ Z/j) (&{ j ? i , • • • 7 Xn ) • • • , 

for all variables yj, z i, ... ,z„ and any atomic wf . . . ,y„) 

without function letters or individual constants. Hence, it suffices to show: 

(*) If t(x,x) is a term without individual constants and /(x,j>) comes from 
/(x,x) by replacing some occurrences of x by y, then \~ x — y -> /(x,x) 
= t(x,y). t 

But (*) can be proved, using hypothesis (b), by induction on the number of 
function letters in f(x,x), and we leave this as an exercise. 

It is easy to see from Proposition 2.25 that, when the language of K has 
only finitely many predicate and function letters, it is only necessary to 
verify (A7) for a finite list of special cases (in fact, n wfs for each A" and n 
wfs for each f"). 


Exercises 

2.66 Let Kj be a theory whose language has only = as a predicate letter 
and no function letters or individual constants. Let its proper axioms be 
(Vxi ) X] = Xj , (Vxi ) (VX2) (xi =x*2 => X2 ~x'i) and (Vxi)(Vx'2)(Vx3)(xi — X2 
(x 2 = X3 =4- Xi = X3)). Show that Kj is a theory with equality. [Hint: It 


Efhe reader can clarify how (*) is applied by using it to prove the following 
instance of (A7): Hx=y=> (A } ( / ^ (x) ) => ^|(/] 1 (3 ; )))- Let t(x,x) be f/(x) and let 
t(x,y) be Z, 1 ( y) . 
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suffices to prove that F X) = x 3 =$► (xi — X2 => *3 = X2) and F X2 = a 3 
(xi = x'2 => a'i — x 3 ).] K t is called the pure first-order theory of equality, 

2.67 Let K2 be a theory whose language has only = and < as predicate 
letters and no function letters or individual constants. Let K2 have the 
following proper axioms. 

(a) (Vxi)xi=x'i 

(b) (Vxi ) (VX2) (x'i = X2 => X2 = *1 ) 

(c) (VX] ) (Va' 2 ) (Vx'3 ) (a'i = A'2 (a'2 = X 3 => X\ = X3)) 

(d) (Vxi)( 3 x 2 )( 3 a- 3 )(xi < X2 A X3 < A'i ) 

(e) (Vxj ) (Va'2 ) (Ya‘3 ) (a'i < A‘2 A A‘2 < X3 => A'l < X3) 

(f) (Vaj )(Ya'2)(a'| = A '2 =>- _l A'l < A‘2) 

(g) (Va- 1 )(Ya‘2)(a‘, < A'2 V Xi = A'2 V A'2 < A'l ) 

(h) (Vai)(Va'2)(ai < a'2 => ( 3 a 3 )(ai < a 3 A a 3 < X2)) 

Using Proposition 2 . 25 , show that K2 is a theory with equality. K2 is called 
the theory of densely ordered sets with neither first nor last element. 

2.68 Let K be any theory with equality. Prove the following. 

(a) h Aj = y\ A ... A A/; = y„ => t(x 1 , . . . , x„) = t( yi , . . . ,y„), where 

t(yi,...,y„) arises from the term /(a'j, . . . ,x„) by substitution of 
y \ , . . . , y n for a'i , , A n , respectively. 

(b) h Xi = pi A ... A x„ = y„ {&{x\ , . . . , x n ) <=> &(yi, . . . ,y „ )), where 

yi , . . . , y „ ) is obtained by substituting y\,. . . ,y„ for one or more 
occurrences of xi,...,x„, respectively, in the wf ^(xt, . . . , x„), and 
j’l, . . . ,y„ are free for xi, . . . ,x„, respectively, in the wf ^(xi, . . . ,x H ). 

Examples. 

(In the literature, ‘elementary’ is sometimes used instead of ‘first-order’.) 

1 . Elementary theory G of groups: predicate letter =, function letter / 2 , and 
individual constant . We abbreviate / 2 ( t,s ) by t + s and a\ by 0 . The 
proper axioms are the following. 

(a) xi -F (x 2 +a 3 ) — (xi +A2) +A3 

(b) xi + 0 = Aj 

(c) (Va'i ) ( 3 a 2 )x, + A‘2 — 0 

(d) A'l = Xi 

(e) Xi = A‘2 => A'2 = A'l 

(f) A'l = A'2 => (A 2 = A'3 => A'l = A3) 

(g) A'i = A '2 => (Xi + A3 = X2 + X 3 A A3 + Ai = A 3 + A'2) 

That G is a theory with equality follows easily from Proposition 2 . 25 . If 
one adds to the axioms the following wf: 

(h) Aj + A'2 — A'2 + At 

the new theory is called the elementary theory of abelian groups. 

2 . Elementary theory F of fields: predicate letter =, function letters / 2 and 
/ 2 , and individual constants a\ and 02. Abbreviate / 2 (t,j) by 
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t + s, fjit) s') by t ■ s, and ai and a 2 by 0 and 1. As proper axioms, take 
(a)-(h) of Example 1 plus the following. 

(i) A] = A " 2 =» (aTj • X 3 =X2‘ A 3 A A 3 • A'j = A 3 • X 2 ) 

(j) Al ’ (A 2 • A 3 ) = (Ai • A 2 ) • A 3 

(k) Ai • (a 2 +a 3 ) = (ai • a 2 ) 4- (aj -a 3 ) 

(l) Al • A 2 = A '2 • Al 

(m) Al ■ 1 = Al 

(n) Ai 0 => ( 3 ^ 2 ) A 1 • A 2 — 1 

(o) 0 ^ 1 

F is a theory with equality. Axioms (a)-(m) define the elementary theory 
R c of commutative rings with unit. If we add to F the predicate letter A\, 
abbreviate A^(t,s) by t < s, and add axioms (e), (f) and (g) of 
Exercise 2.67, as well as xj < X 2 =y Ai + X 3 < A 2 +X 3 and 
xi < A 2 A 0 < X 3 =y Xi • X 3 < a '2 • A 3 , then the new theory F< is called the 
elementary theory of ordered fields. 

Exercise 

2.69 (a) What formulas must be derived in order to use Proposition 2.25 to 

conclude that the theory G of Example 1 is a theory with equality? 
(b) Show that the axioms (d)-(f ) of equality mentioned in Example 1 
can be replaced by (d) and 

(f '): Al = A '2 => (A 3 = A '2 => A'l = X 3 ). 

One often encounters theories K in which = may be defined; that is, there 
is a wf £(x,y) with two free variables x and y, such that, if we abbreviate 
${t, s) by t — s, then axioms (A 6 ) and (A7) are provable in K. We make the 
convention that, if t and s are terms that are not free for x and y, respec- 
tively, in £(x,y), then, by suitable changes of bound variables (see Exercise 
2.48), we replace (x,p) by a logically equivalent wf £*{x,y) such that t and s 
are free for x and y, respectively, in £* (x, >’) ; then t — s is to be the ab- 
breviation of $*{t,s). Proposition 2.23 and analogues of Propositions 2.24 
and 2.25 hold for such theories. There is no harm in extending the term 
theory with equality to cover such theories. 

In theories with equality it is possible to define in the following way 
phrases that use the expression ‘There exists one and only one x such 
that. . 


DEFINITION 

(3ix)^?(x) for (3x)^(x) A (Vx)(Vv)(^(a) A &(y) => x =>>) 

In this definition, the new variable y is assumed to be the first variable that 
does not occur in A similar convention is to be made in all other 

definitions where new variables are introduced. 
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Exercise 

2.70 In any theory with equality, prove the following. 

(a) F (Vx)(3iy) x = y 

(b) F (3ix)^(x) yy (3x) (Vy) (x = y yy >’)) 

(c) F (Vx)(^(x) yy ^(x)) [(3ix)^(x) yy (3ix)^(x)] 

(d) F (3ix)(^ V *Jf) ((3ix)J?) V (3ix)^ 

(e) F (3ix)^(x) ^ (3x)(^(x) A (Vj>)(#(j>) = x)) 

In any model for a theory K with equality, the relation E in the model 
corresponding to the predicate letter = is an equivalence relation (by Pro- 
position 2.23). If this relation E is the identity relation in the domain of the 
model, then the model is said to be normal. 

Any model M for K can be contracted to a normal model M* for K by 
taking the domain D* of M* to be the set of equivalence classes determined 
by the relation E in the domain D of M. For a predicate letter A" and for any 
equivalence classes [Aj], . . . , [A„] in D* determined by elements by,... ,b„ in 
D, we let ( A'j) M hold for ([Ai [A,,]) if and only if (A'j) M holds for 
(by, ... ,b„). Notice that it makes no difference which representatives 
by , . . . , A„ we select in the given equivalence classes because, from (A7), 
F xj = yy A . . . A x„ = y„ -> (A"(x i, . . . ,x„) yy A*(y i, . . . ,y„)). Likewise, for 
any function letter JJ and any equivalence classes [Ai], . . . , [b„\ in D*, let 
(fj) M ([&ij, - - - , [&»]) = Again note that this is in- 

dependent of the choice of the representatives by, . . . ,b„, since, from (A7), 
we can prove F Xi = yy A . . . A x* = y„ => //(xi , . . . ,x„) = fj(y x ,... ,y„). 
For any individual constant a,- let = [(tf,) M ]. The relation E* corre- 

sponding to = in the model M* is the identity relation in D*: E*([Aj], [A 2 ]) if 
and only if E(Aj , bj), that is, if and only if [Ai] •= [A 2 ] - Now one can easily 
prove by induction the following lemma: If s = (Ai , A 2 , . . .) is a denumerable 
sequence of elements of D, and s' = ( [Aj ] , [A 2 ], . . .) is the corresponding se- 
quence of equivalence classes, then a wf Aft is satisfied by s in M if and only if 
is satisfied by s' in M*. It follows that, for any wf 2ft, 2ft is true for M if and 
only if Aft is true for M*. Hence, because M is a model of IC, M* is a normal 
model of K. 


PROPOSITION 2.26 (EXTENSION OF PROPOSITION 2.17) 

(Godel, 1930) Any consistent theory with equality K has a finite or denu- 
merable normal model. 

Proof 

By Proposition 2.17, K has a denumerable model M. Hence, the contraction 
of M to a normal model yields a finite or denumerable normal model M* 
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because the set of equivalence classes in a denumerable set D is either finite 
or denumerable. 
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COROLLARY 2.27 (EXTENSION OF THE SKOLEM-LOWENHEIM 
THEOREM) 

Any theory with equality K that has an infinite normal model M has a 
denumerable normal model. 


Proof 

Add to K the denumerably many new individual constants b \ , 62 > ■ . . together 
with the axioms b,- 7 ^ bj for / ^ j. Then the new theory K 7 is consistent. If K 7 
were inconsistent, there would be a proof in IC' of a contradiction # A -$?, 
where we may assume that ^ is a wf of K. But this proof uses only a finite 
number of the new axioms: b ix ^ bj x , . . . , h, n 7 ^ b ln . Now, M can be extended 
to a model M# of K plus the axioms h,-, 7 ^ bj x , . . . , b ln 7 ^ b Jn ; in fact, since M is 
an infinite normal model, we can choose interpretations of b ix , b lx , . . . , bj n , bj n , 
so that the wfs b ix 7 ^ b fx , . . . , bj lt / bj n are true. But, since A -f& is derivable 
from these wfs and the axioms of K, it would follow that ^ A is true for 
which is impossible. Hence, K 7 must be consistent. Now, by Proposi- 
tion 2.26, IC' has a finite or denumerable normal model N. But, since, for 
\ yh /, the wfs h t 7 ^ bj are axioms of IC', they are true for N. Thus, the 
elements in the domain of N that are the interpretations of b \ , 62 , . . . must be 
distinct, which implies that the domain of N is infinite and, therefore, de- 
numerable. 


Exercises 

2.71 We define (3„x)2tf(x) by induction on n^\. The case n = 1 has al- 
ready been taken care of. Let (3 II+ \x)&(x) stand for 
( 3 \y)(&(y) A(3„x)(x ^ y A 28 {x))). 

(a) Show that ( 3 n x)28{x ) asserts that there are exactly 77 objects for which '28 
holds, in the sense that in any normal model for (3„x)28(x) there are 
exactly n objects for which the property corresponding to 28 (x) holds. 

(b) (i) For each positive integer 77 , write a closed wf 28 „ such that 28 n is 

true in a normal model when and only when that model contains at 
least 77 elements. 

(ii) Prove that the theory K, whose axioms are those of the pure theory 
of equality Kj (see Exercise 2.66), plus the axioms 28\,282, • • •, is 
not finitely axiomatizable, that is, there is no theory K 7 with a finite 
number of axioms such that K and K 7 have the same theorems. 
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(iii) For a normal model, state in ordinary English the meaning of 

~ 1 ^H+ 1 - 

(c) Let n be a positive integer and consider the wf (<?„) (3„x)x = x. Let L„ 
be the theory Kj +{$„}■> where Ki is the pure theory of equality. 

(i) Show that a normal model M is a model of L„ if and only if there 
are exactly n elements in the domain of M. 

(ii) Define a procedure for determining whether any given sentence is a 
theorem of L M and show that L„ is a complete theory. 

2.72 (a) Prove that, if a theory with equality K has arbitrarily large finite 

normal models, then it has a denumerable normal model. 

(b) Prove that there is no theory with equality whose normal models 
are precisely all finite normal interpretations. 

2.73 Prove that any predicate calculus with equality is consistent. (A 
predicate calculus with equality is assumed to have (A1)-(A7) as its only 
axioms.) 

2.74 d Prove the independence of axioms (A1)-{A7) in any predicate cal- 
culus with equality. 

2.75 If 28 is a wf that does not contain the = symbol and 28 is provable in a 
predicate calculus with equality IC, show that 28 is provable in K without 
using (A6) or (A7). 

2.76 d Show that = can be defined in any theory whose language has only a 
finite number of predicate letters and no function letters. 

2.77 (a) A Find a non-normal model of the elementary theory of groups G. 
(b) Show that any model M of a theory with equality K can be ex- 
tended to a non-normal model of IC. [Hint: Use the argument in 
the proof of the lemma within the proof of Corollary 2.22.] 

2.78 Let 28 be a wf of a theory with equality. Show that 28 is true in every 
normal model of IC if and only if I~k 28. 

2.79 Write the following as wfs of a theory with equality. 

(a) There are at least three moons of Jupiter. 

(b) At most two people know everyone in the class. 

2.80 If P(u) means it is a person , G(«, v) means it is a grandparent of v, and 
u = v means that u and v are identical, translate the following wf into or- 
dinary English: 


(Vx) (P(x) ( 3.vi ) ( 3x 2 ) ( =Ly 3 ) ( 3x 4 ) (x i f x 2 A xi / A 3 A a-] ^ X 4 A 

A'2 f X3 A X2 7^ X4 A A'3 f A'4 A G(a'1 , A') A G( X 2 , A') A G(a - 3 , A') A 
G(a' 4 , A-) A (Vy)(G( y, x) y = x\ V y - x 2 V y = x 3 V y — x 4 )) ) 


2.81 Consider the wf 

(*) (Va-)(V;>)(3z)(z f a- Az fyAA(z)). 

Show that (*) is true in a normal model M of a theory with equality if and 
only if there exist in the domain of M at least three things having property 
A(z). 
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2.82 Let the language Jz? have the four predicate letters =, P, S and L. 
j^ e ad u = v as u and v are identical, P(u) as u is a point, S(u ) as it is a line, and 
l(u, v) as u lies on v. Let the theory of equality <G of planar incidence geo- 
metry have, in addition to axioms (A1)-(A7), the following non-logical 

axioms. 
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(1) F(x) ^ ~>S{x) 

(2) L(x,y) => P(x) AS(y) 

(3) S(x) => (3j-')(3z)(j> ^ z A L(y,x) AL(z,x)) 

(4) fWAP(j') (3iz)(S(z) Al(x.z) A L(y,z)) 

(5) (3x)(3y)(3z)(P(x) A P(y) A P(z) A ^(x,y,z)) 

where ( 8{ x,y,z ) is the wf (3u)(S(it) AL(x,u) A L(y,u) AL(z,u)), which is 
read as x, y, z are coll in ear. 

(a) Translate (l)-(5) into ordinary geometric language. 

(b) Prove F® (V«)(Vt>) (S(u) AS(v) A it ^ v (\/x) (Vy) (L(x,u) A L(x, v)A 
L(y,u) AL(y,u) => x = y)), and translate this theorem into ordinary 
geometric language. 

(c) Let R(u,v) stand for S(u) A S(v) A -i(3w)(Z(w, u) A L(w, v)). Read 
R(it, v ) as u and v are distinct parallel lines. 

(i) Prove: F® R(u , v) it ^ v 

(ii) Show that there exists a normal model of (G with a finite domain in 
which the following sentence is true: 

(Vx)(Vy)(5(x) A P(y) A ^L(y,x) => (3,z)(L(y,z) Ai?(z,*))) 

(d) Show that there exists a model of (G in which the following sentence is 
true: 

(Vx)(\/y)(S(x) A S(y) Ax ?y=> - R(x,y )) 


2.9 DEFINITIONS OF NEW FUNCTION LETTERS 
AND INDIVIDUAL CONSTANTS 


In mathematics, once we have proved, for any yi,. . . ,y„, the existence of a 
unique object u that has a property 8ft(u,yi, . .. ,y n ), we often introduce a 
new function letter f{y\,.-.,y„) such that 8ft(f(y\,...,y n ),y\,... , y „ ) holds 
for all )’j , . . . , y n . In cases where we have proved the existence of a unique 
object u that satisfies a wf 8ft{u) and 8ft {it) contains u as its only free variable, 
then we introduce a new individual constant b such that 8ft (b) holds. It is 
generally acknowledged that such definitions, though convenient, add 
nothing really new to the theory. This can be made precise in the following 
manner. 
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PROPOSITION 2.28 

Let K be a theory with equality. Assume that F K (3j . . , y„ ) . Let 
K # be the theory with equality obtained by adding to IC a new function letter 
/ of n arguments and the proper axiom 38(f(yi, • • ■ • • • ,y n )? as well 

as all instances of axioms (A1)-(A7) that involve /. Then there is an effective 
transformation mapping each wf ( £ of K # into a wf of K such that: 

(a) If / does not occur in c £, then is c £. 

(b) (-.^) # is 

(c) iff => 9)* is <£* ^ 9*. 

(d) ((Vx/F))^ is (Vxr)(^ # ). 

(e) F K# (^<^ # ). 

(f) If F k# then F K V*. 

Hence, if ^ does not contain / and F K # then Fk c £- 


Proof 

By a simple f-term we mean an expression /(q , . . . , t„) in which q , . . . , are 
terms that do not contain /. Given an atomic wf ( £ of K ^ , let be the 
result of replacing the leftmost occurrence of a simple term /"(q ,.../„) in ^ 
by the first variable v not in <£ or Call the wf (3v)(38(v, q , . . . , t n ) A <&*) 
the /- transform of r 6\ If r £ does not contain /, then let r £ be its own f~ 
transform. Clearly, F K # (3v)(&(v, q, . . . , t„) A c £*) c £. (Here, we use 
h K (3iu).^(u,yu • • • ,T«) and the axiom -^(/( ji, • • • • ■ • ,Tn) of K # .) 

Since the /-transform ( £' of ^ contains one less / than ( £ and F K # c £' ^ if 
we take successive /-transforms, eventually we obtain a wf that does not 
contain / and such that F K # ( £. Call the f-less transform of ( £. 

Extend the definition to all wfs of by letting (-iS)^ be -i (£$?#), => S) # 

be =y and ((Vx)^)^ be (Vx)^ # . Properties (a)-(e) of Proposition 
2.28 are then obvious. To prove property (f), it suffices, by property (e), to 
show that, if ( £ does not contain / and F K # </, then h K # ( £. We may assume 
that ^ is a closed wf, since a wf and its closure are deducible from each 
other. 

Assume that M is a model of K. Let Mi be the normal model obtained by 
contracting M. We know that a wf is true for M if and only if it is true for 
Mi. Since Fk (3im)^?(m,ti, < • • ,y«), then, for any b \, . . . ,b„ in the domain of 
Mj , there is a unique c in the domain of Mi such that [=Mi &[c, b \ , . . . , b„\. If 
we define/ [b \ , . . . , b„) to be c, then, taking / to be the interpretation of the 
function letter /, we obtain from Mj a model M^ of Ka. For the logical 
axioms of K# (including the equality axioms of K ^ ) are true in any normal 


tit is better to take this axiom in the form (Vm)(w = f(y\ , ,y„) 
${u,y \, . . . , 3 ',,)), since f(yi , . . . , ) might not be free for u in £S(u,y \ , . . . ,y„). 


• m ■ 
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interpretation, and the axiom <#(/( y \ , • . . ,y„),yi , - - • ,y,i) also holds in 
hy virtue of the definition of /j . Since the other proper axioms of K# do not 
contain / and since they are true for Mi, they are also true for M^. But 
l_ # ( S , Therefore, is true for , but since ( S does not contain /, ^ is true 

for Mi and hence also for M. Thus, is true for every model of K. 
Therefore, by Corollary 2.20(a), Hk (In the case where Fk (3j u)0S{ii) and 
${ii) contains only u as a free variable, we form K# by adding a new 
individual constant b and the axiom $(b). Then the analogue of Proposition 
2.28 follows from practically the same proof as the one just given.) 
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Exercise 

2.83 Find the /-less transforms of the following wfs. 

(a) (Vx)(3/)(^(x,p,/(x,pi , . . . ,;;„)) /(p,x, . . . ,x) = x) 

(b) A\{f{y u . . . ,y„-i,/(Ti, . . . A (3 x)^(x,/(ti, • . . ,t„)) 

Mote that Proposition 2.28 also applies when we have introduced several 
new symbols / , . . . ,f„ because we can assume that we have added each / to 
the theory already obtained by the addition of/i , . . . i; then m successive 
applications of Proposition 2.28 are necessary. The resulting wf ^ of K can 
be considered an (/, . . . ,/ OT )-free transform of # into the language of K. 

Examples 

1. In the elementary theory G of groups, one can prove (3i_y) x + y = 0. 
Then introduce a new function f of one argument, abbreviate f(t) by 
(— r), and add the new axiom x + (— x) = 0. By Proposition 2.28, we now 
are not able to prove any wf of G that we could not prove before. Thus, 
the definition of (—/) adds no really new power to the original theory. 

2. In the elementary theory F of fields, one can prove that 
(3,y)((x^0 A x • y = 1) V (x = 0 and y = 0)). We then introduce a new 
function letter g of one argument, abbreviate g(t) by f 1 , and add the 
axiom (jc^OAx- x _1 = 1) V (x = 0 and x -1 = 0), from which one can 
prove x /= 0 =y x • x -1 = 1 . 

From Proposition 2.28 we can see that, in theories with equality, only 
predicate letters are needed; function letters and individual constants are 
dispensable. If is a function letter, we can replace it by a new predicate 
letter A% +1 if we add the axiom [3iu)A^ +l {u,yi , . . . ,y„). An individual con- 
stant is to be replaced by a new predicate letter A\ if we add the axiom 

(3iw)4.(w). 

Example 

In the elementary theory G of groups, we can replace + and 0 by predicate 
letters A\ and A\ if we add the axioms (Vxi)(Vx 2 ) (3iX3)>4f(xi,X2,X3) and 
(3iXi)yfJ(xi), and if we replace axioms (a), (b), (c) and (g) by the following: 
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(a') A\ (x 2 , X3 , u) A A \ (xi , it, v ) A A\ (xi , x 2 , w) A A\ (w, *3 , y) => v = y 

(b') A\ (y) A A\(x,y,z) =>z=x 

(c') (3>-)(Va)(V/;)(/!] (it) A A](x,y, v) ^ v = it) 

(g') [a'i = x 2 A A\ (X] , y, z) A A] (x 2 , y, u) A A\ ( y, x, , v) A A\ ( y, x 2 , w)] 

=> z = u A v — w 

Notice that the proof of Proposition 2.28 is highly non-constructive, since 
it uses semantical notions (model, truth) and is based upon Corollary 
2.20(a), which was proved in a non-constructive way. Constructive syntac- 
tical proofs have been given for Proposition 2.28 (see Kleene, 1952, § 74), 
but, in general, they are quite complex. 

Descriptive phrases of the kind ‘the it such that 9S(u,y \, . . . ,y„y are very 
common in ordinary language and in mathematics. Such phrases are called 
definite descriptions. We let iu(88(it,yi , . . . ,y„)) denote the unique object u 
such that 88(11, y \, . . . ,y„) if there is such a unique object. If there is no such 
unique object, either we may let ni(88(ii,yi, . . . ,y„)) stand for some fixed 
object, or we may consider it meaningless. (For example, we may say that 
the phrases ‘the present king of France’ and ‘the smallest integer’ are 
meaningless or we may arbitrarily make the convention that they denote 0.) 
There are various ways ofincorporating these /-terms in formalized theories, 
but since in most cases the same results are obtained by using new function 
letters or individual constants as above, and since they all lead to theorems 
similar to Proposition 2.28, we shall not discuss them any further here. For 
details, see Hilbert and Bernays (1934) and Rosser (1939; 1953). 

2.10 PRENEX NORMAL FORMS 

A wf (Qiyi)-. ■ ■ (Q„y„)88, where each (Qijfi is either (V>y) or (3y,-),p 7 - is dif- 
ferent from )>j for i fi j, and 88 contains no quantifiers, is said to be in prenex 
normal form. (We include the case n — 0, when there are no quantifiers at 
all.) We shall prove that, for every wf, we can construct an equivalent prenex 
normal form. 

LEMMA 2.29 

In any theory, if y is not free in 88, and c d(x) and r f(y) are similar, then the 
following hold. 

(a) h ((Vx)V(x) => 0) (3y)(V(y) => 0) 

(b) F ((3x)%(x) =>£>)^ (\/y)(<#(y) => 

(c) F (8$ =y (Vx)Y’(x)) (Vy ) (88 =4> r d(y)) 

(d) F (88 => (3x)<g(x)) ^ (3y)(® => <${y)) 

(e) F -.(Vx)# (3x)^ 

(f) I — ■(3x) < £ o (Vx)-i^ 
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for part (a): 

1. (Vx)^(x) =r* s> 

2. A3y)Wy) => @) 

3. => S) 

4. (Vf)-i(^(f) => 

5. (V>’)(^(.y) A-*^) 

6. #(p) A 

7. tf(j) 

8. (Vp)^(f) 

9. (Vx)^(x) 

10. ^ 

11. 

12 . 

13. (Vx)#(x) =S> 

— i(3j>)(^(j-’) iS*) f S A — 

14. (Vx)^(x) => <3 

h (3y)(<f(j,) =* S) 

15. f (Vx)^(x) => 0 

=* (3y) (<?(>>) =4> &) 

The converse is proven in the fol 

1. mWy) =» 

2. (V*)#(x) 

3. <${b) => S 

4. ^(h) 

5. s> 

6. (3y(<?( p) => S), 

(Vx')^(v) He S) 

7- (3 \y)my)=>&), 

(Vx)^(x) h ^ 

8. h (3j,)(^(3’) =» Si) 

=$■ ((Vx)^(x) 


Hyp 

Hyp 

2, abbreviation 

3, negation elimination 

4, tautology, Proposition 2.9(c) 

5, rule A4 

6, conjunction elimination 

7, Gen 

8, Lemma 2.11, biconditional 
elimination 

l, 9, MP 

6, conjunction elimination 
10, 11, conjunction introduction 

1 12 

1 1 3, proof by contradiction 
1-14, Corollary 2.6 
ing manner. 

Hyp 

Hyp 

1, rule C’ 

2, rule A 4 

3, 4, MP 

1-5 

6, Proposition 2.10 
1-7, Corollary 2.6 twice 


Part (a) follows from the two proofs above by biconditional introduction. 
Parts (b)-(f) are proved easily and left as an exercise. (Part (f) is trivial, and 
(e) appeared as Exercise 2.33(a); (c) and (d) follow easily from (b) and (a), 
respectively.) 

Lemma 2.29 allows us to move interior quantifiers to the front of a wf. 
This is the essential process in the proof of the following proposition. 




QUANTIFICATION THEORY 


108 


PROPOSITION 2.30 

There is an effective procedure for transforming any wf 38 into a wf in 
prenex normal form such that 38 


Proof 

We describe the procedure by induction on the number k of occurrences of 
connectives and quantifiers in 38. (By Exercise 2.32(a, b), we may assume 
that the quantified variables in the prefix that we shall obtain are distinct.) If 
k — 0, then let ^ be 38 itself. Assume that we can find a corresponding ( S for 
all wfs with k < n, and assume that 38 has n occurrences of connectives and 
quantifiers. 

Case 1. If 38 is -i <2>, then, by inductive hypothesis, we can construct a 
wf S in prenex normal form such that h 33 yy S . Hence, I — yy -i S by 
biconditional negation. Thus, h J<^> ->S, and, by applying parts (e) and 
(f) of Lemma 2.29 and the replacement theorem (Proposition 2.9(b)), we 
can find a wf ^ in prenex normal form such that I — >S yy ( S. Hence, 
h 38 yy . 

Case 2. If 38 is 38 =3 S, then, by inductive hypothesis, we can find wfs 38\ 
and Si in prenex normal form such that h 38 yy 38 \ and h S yy S \. Hence, 
by a suitable tautology and MP, h (^ => S) yy (3$i =3 S\), that is, 
h $ yy {38 1 =3 S\). Now, applying parts (a)-(d) of Lemma 2.29 and the 
replacement theorem, we can move the quantifiers in the prefixes of 38\ and 
S\ to the front, obtaining a wf f in prenex normal form such that 

h ®yy ( s. 

Case 3. If 38 is (Vx)^, then, by inductive hypothesis, there is a wf in 
prenex normal form such that h 38 yy 38 1 \ hence, h 38 yy ( Vx) 38 1 by Gen, 
Lemma 2.8, and MP. But (Vx).S'i is in prenex normal form. 


Examples 

1. Let ;38 be (Vx) (/I j (x) =3 (Vy) (Al(x,y) =3 -i(Vz)v43(y,z))). By part (e) of 
Lemma 2.29: (Yx) (/I J (x) =3 (Vy) (A\ (x, y) =3 (Bz^A^y^z)]). 

By part (d): (Yx)(A|(x) => (Vy) (3w) \A\ (x, y) => -iAj(y,u ) ]). 

By part (c): (Vx)(Va)(yl}(x) =3 ( Ba)\A\{x , v ) =3 — (t>, */)]). 

By part (d): ( Vx) (Vr) (3w) (A J (x) (A^x, v) =3 -^(u, w))). 

Changing bound variables: (Vx) (Vy) (3z) (A j (x) =3 (A^x^y) =3 -iAj(y,z))). 

2. Let 38 be Aj(x,y) =3 (3y)[,4j(y) => ([(3x)Aj(x)j => A\(y))\. 

By part (b): A\{x,y) =3 (3y)04j(y) => {^u)[A\(u) =3 A\(y)\). 

By part (c): A\{x,y) =3 (3y)(Vr)(^](j’) [A\(v) =>^(y)]). 

By part (d): (3w) (,4 J (x, y ) =y (V/;)[^J(w) => {A\{v) =^^ 2 ( w ))])- 
By part (c): (3w) (Vz) (A \ (x, y) =3 [A[(w) => (A\{z) -Y >4^(w))]). 
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Exercise 

2.84 Find prenex normal forms equivalent to the following wfs. 

(a) [(Vx)(>4 j (x) => Af(x,y))] => ([(3y)i4 j (_>>)] =» (3z)Af(y,z)) 

(b) i (x, y) =>• {A j (x) =*► ->(3w)>4? (x, u ) ) 

A predicate calculus in which there are no function letters or individual 
constants and in which, for any positive integer «, there are infiitely many 
predicate letters with n arguments, will be called a pure predicate calculus. 
For pure predicate calculi we can find a very simple prenex normal form 
theorem. A wf in prenex normal form such that all existential quantifiers (if 
any) precede all universal quantifiers (if any) is said to be in Skolem normal 
form. 


PROPOSITION 2.31 

In a pure predicate calculus, there is an effective procedure assigning to each 
wf 38 another wf S3 in Skolem normal form such that h (ft if and only iff 3f 
(or, equivalently, by Godel’s completeness theorem, such that 38 is logically 
valid if and only if 33 is logically valid). 


Proof 

First we may assume that 38 is a closed wf, since a wf is provable if and only 
if its closure is provable. By Proposition 2.30 we may also assume that 38 is 
in prenex normal form. Let the rank r of 38 be the number of universal 
quantifiers in 33 that precede existential quantifiers. By induction on the 
rank, we shall describe the process for finding Skolem normal forms. 
Clearly, when the rank is 0, we already have the Skolem normal form. Let us 
assume that we can construct Skolem normal forms when the rank is less 
than and let r be the rank of 38. 33 can be written as follows: 
(3yi)...(3j^)(Vi/)^(ji,..., y„,u), where ^(yi t ... t y nt u) has only 
,j>„, u as its free variables. Let A'j +] be the first predicate letter of n F 1 
arguments that does not occur in 33. Construct the wf 


(*i) 


(iyi) . . . (3y„)([(ViO( < <?(>i l . . . ,}’n,u) => A" +l (yi, . . . I/))] 

(Vw)^ +1 (3^, . . . m)) 


Let us show that h 33 if and only if h 33 \ . Assume h 33 \ . In the proof of 38 \ , 
replace all occurrences of A'j l l (zi , ,z n ,w) by 33* (z \ , . . . ,z„, w)), where 33* 
is obtained from 33 by replacing all bound variables having free occurrences 
in the proof by new variables not occurring in the proof. The result is a 
proof of 
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(3yi) ■ • • (3j',,)(((V«)Mti, . ..,y„,u)^«*( yi ,.. .y„, it))) 

=» ( Vi/) r C*(. ,>'„,£/)) 

(#* was used instead of ^ so that applications of axiom (A4) would remain 
applications of the same axiom.) Now, by changing the bound variables 
back again, we see that 

F (3p,) . . . (3y,,)[(Vi/)(#(;>i, . . . ,y„, u) =>• %(y\, . . . ,y„, it)) 

, ^ (V//) < ^(pi, ■ • • iTh) iOJ 

Since I- (Vu) (^(y \ , . . . ,p„, «) => ^(pi, • • • , y n , t/)) , we obtain, by the re- 
placement theorem, F (3p,) . . . (3p„)(Vw)#(pi, . . . ,p„,m), that is, \~ 3ft. 
Conversely, assume that F 3ft. By rule C, we obtain (Vu)#(fei, . . . ,b n ,u). 
But, F (Vi/)^ =y ((Vw)(^ => $) =P (Vu)<f) (see Exercise 2.27 (a)) for any 
wfs ^ and <9. Hence, He (Vh) (#(&,, . . . ,b„, u) =P A’j +l (b\ , . . . , b ln «)) =y 
(Vu)d" +1 (hi, . . . , h„, u). So, by rule E4, F c (3^) . . . (3p„) ([(Vm) (#(£,, 
. ..b„,u) =y i4" +1 (pi, • • • ,P»,u))] => (V«)^4" +1 (pi, . . . ,p„,w)), that is, F c 3ft x . 
By Proposition 2.10, \~ 38\. A prenex normal form of 3ft \ has the form 
^ 2 : ( 3j’i ) . . . (3p„)(3«)((2iZi) • • • (QsZ s ){\/v)^ , where ^ has no quantifiers and 
(Q\,z {) . . . (Q s z s ) is the prefix of c 3. [In deriving the prenex normal form, 
first, by Lemma 2.29(a), we pull out the first (Vt/), which changes to (3//); 
then we pull out of the first conditional the quantifiers in the prefix of ( 3. By 
Lemma 2.29(a, b), this exchanges existential and universal quantifiers, but 
then we again pull these out of the second conditional of 3ft \ , which brings 
the prefix back to its original form. Finally, by Lemma 2.29(c), we bring the 
second ( \fu ) out to the prefix, changing it to a new quantifier (Vr).] Clearly, 
3ft 2 has rank one less than the rank of 38 and, by Proposition 2.30, 
h 38 \ S& 2 - But, h 3ft if and only if F 3ft \ . Hence, F 38 if and only if F 3$i. By 
inductive hypothesis, we can find a Skolem normal form for 3ft 2 , which is 
also a Skolem normal form for 3ft. 


Example 

3ft :(VA-)(Vy)(3z) Y(x,p, z). where Y contains no quantifiers 
^i : (Va)((Vp)(3z)^(a,p, z) =>- Aj (x) ) => (' Mx)Aj (x) , where Aj is not in 9f. 

We obtain the prenex normal form of 3ft 

(Tr)([(Vp)(3z)^(x,p, z) => Aj (x)] => (Vx)Aj (x)) 2.29(a) 

(3x)((3y)[(3z)#(x,p,z) =>Aj(x)] => (Vx)Aj(x)) 2.29(a) 

(Tv) ((3,0 (Vz) [<f (x, p, z) =► Aj (x)] =► (\/x)Aj (x) ) 2.29(b) 

(3x)(Vp)[(Vz)(ff(x,p,z) =Mj(x)) => (Vx)t!-(x)] 2.29(b) 

(3.v)(Vp)(3z)([^(x,p,z) =yAj{x)) => (Vx)Aj(x)) 2.29(a) 

(^)(Vy)(3z)(Vv)[Mx,y,z) => Aj(x)) =» Aj(v)] 2.29(c) 
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We repeat this process again: Let ^(x,y,z, v) be => A}(x)) 

^Aj(v). Let A\ not occur in <2. Form: 

(3x) ( [ (Vy) [ ( 3z) (Vr ) (x, y, z, u) ) =^jt(*,p)]] =► (Vy)Al(x,y)) 

(3x) (3y) [[(3z) (Vu) (2(x,y, z, v)) => A 2 k (x,y)} => (\/y)Al(x,y)] 2.29(a) 

(3x)(3 \y)(3z)(\fv)([2(x,y,z, v) => A 2 k (x,y)] (Vy)A k (x,y)) 2.29(a,b) 

(3.x)(3y)(3z)(Vt;)(Vw)([^( A -,^z, v) => A 2 (x,y)] =► A 2 (x,w)) 2.29(c) 

Thus, a Skolem normal form of AA is: 

(3.Y)(3^)(3z)(Vr)(Vw)([((^(x,^,z) =» A)(x)) =y ^J(r)) =► A 2 k {x,y)\ =>Al(x,w)) 


Exercises 

2.85 Find Skolem normal forms for the following wfs. 

(a) ->(3xM 1 (x) => (V») (3y) (Vx)A 3 (u, x, y) 

(b) (Vx) (3y) (Vm) (3t')Af (x,y, u, v ) 

2.86 Show that there is an effective procedure that gives, for each wf 98 of a 
pure predicate calculus, another wf 2 of this calculus of the form 
(W, ) . . . (Vj„)(3zi) . . . (3z„,)#, such that is quantifier-free, n, m^O, and 98 is 
satisfiable if and only if 2 is satisfiable. [Hint: Apply Proposition 2.3 1 to 

2.87 Find a Skolem normal form 5/ for (Vx)(3)’)/I^(x,>') and show that it is 
not the case that \r ff ! <=y (Vx)(3^)/l|(x, y). Hence, a Skolem normal form for 
a wf 28 is not necessarily logically equivalent to 38 , in contradistinction to the 
prenex normal form given by Proposition 2.30. 


2.11 ISOMORPHISM OF INTERPRETATIONS. 
CATEGORICITY OF THEORIES 


We shall say that an interpretation M of some language is isomorphic 
with an interpretation M* of if and only if there is a one one corres- 
pondence g (called an isomorphism) of the domain D of M with the domain 
D* of M* such that: 

1. For any predicate letter A’j of 99 and for any bi,. . . ,b„ in 
D, (=m A n j[b\ yb„\ if and only if (=m* A”[g(bi), g{b»)\. 

2. For any function letter fj of 99 and for any b\, . . . ,b„ in 
D, 0((/;') M (fci . • • •>*.)) = (fj) M \g(b ,), . . . ,g(b„)). 

3. For any individual constant aj of 99,g((ay) ) = (ay) 

The notation M ~ M* will be used to indicate that M is isomorphic with 
M*. Notice that, if M « M*, then the domains of M and M* must be of 
the same cardinality. 
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PROPOSITION 2.32 


If g is an isomorphism of M with M*, then: 

(a) for any wf 38 of ££, any sequence s — (&i,h 2 , • • •) of elements of 
the domain D of M, and the corresponding sequence g(s) = 
(g(b\), g(b 2 ), . . .), s satisfies 38 in M if and only if g(s) satisfies 38 in M*- 

(b) hence, (=m 38 if and only if |= M * 38. 


Proof 


Part (b) follows directly from part (a). The proof of part (a) is by induction 
on the number of connectives and quantifiers in 38 and is left as an exercise. 

From the definition of isomorphic interpretations and Proposition 2.32 
we see that isomorphic interpretations have the same ‘structure’ and, thus, 
differ in no essential way. 

Exercises 


2.88 Prove that, if M is an interpretation with domain D and D* is a set 
that has the same cardinality as Z), then one can define an interpretation M* 
with domain D* such that M is isomorphic with M*. 

2.89 Prove the following: (a) M is isomorphic with M. (b) If Mi is iso- 
morphic with M 2 , then M 2 is isomorphic with Mi - (c) If Mi is isomorphic 
with M 2 and M 2 is isomorphic with M 3 , then Mi is isomorphic with M 3 . 

A theory with equality K is said to be nt — categorical , where m is a 
cardinal number, if and only if: any two normal models of K of cardinality 
rrt are isomorphic; and K has at least one normal model of cardinality m (see 
Los, 1954c). 

Examples 

1. Let K 2 be the pure theory of equality Ki (see page 98) to which has been 
added axiom (E2): (ELyi )(ELy 2 )(xi 7 ^ x 2 A ( ) (x 3 = xi V *3 =x 2 )). Then 
K 2 is 2-categorical. Every normal model of K 2 has exactly two elements. 
More generally, define (E«) to be: 

(ELvi)...(3x„)| x, 7 ^ Xj A (VyXy =x\ V . . . Vy = x n ) 

\tsS/<ysOi 

where Ai $;</$»( x i A x j is the conjunction of all wfs Xj^Xj with 
1 </ < j^n. Then, if K 7 ' is obtained from Ki by adding (E/?) as an 
axiom, K" is //-categorical, and every normal model of K" has exactly n 
elements. 

2 . The theory K 2 (see page 98) of densely ordered sets with neither first nor 
last element is Ko — categorical (see Kamke, 1950, p. 71: every denumer- 
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able normal model of K 2 is isomorphic with the model consisting of the 
set of rational numbers under their natural ordering). But one can prove 
that K 2 is not m— categorical for any m different from Ho. 

Exercises 

2,90 a Find a theory with equality that is not Ho —categorical but is m— ca- 
tegorical for all m > Ho- [Hint: Consider the theory Gc of abelian groups 
(see page 98). For each integer n, let ny stand for the term (>'+>') + ••• T y 
consisting of the sum of n ys. Add to Gc the axioms (&„): (Vx)piy )(//_>■■ = „v) 
for all «^2. The new theory is the theory of uniquely divisible abelian 
groups. Its normal models are essentially vector spaces over the field of 
rational numbers. Flowever, any two vector spaces over the rational num- 
bers of the same non-denumerable cardinality are isomorphic, and there are 
denumerable vector spaces over the rational numbers that are not iso- 
morphic (see Bourbaki, 1947).] 

2.91 A Find a theory with equality that is m— categorical for all infinite 
cardinals m. [Hint: Add to the theory Gc of abelian groups the axiom 
(Vxi)(2xj = 0). The normal models of this theory are just the vector spaces 
over the field of integers modulo 2. Any two such vector spaces of the same 
cardinality are isomorphic (see Bourbaki, 1947).] 

2.92 Show that the theorems of the theory K" in Example 1 above are 
precisely the set of all wfs of K" that are true in all normal models of 
cardinality n. 

2.93 a Find two non-isomorphic densely ordered sets of cardinality 2 K ° with 
neither first nor last element. (This shows that the theory K 2 of Example 2 is 
not 2 Ko —categorical.) 

Is there a theory with equality that is m— categorical for some non- 
countable cardinal nt but not n— categorical for some other non-countable 
cardinal n? In Example 2 we found a theory that is only Ho-categorical; in 
Exercise 2.90 we found a theory that is m— categorical for all infinite m > Ho 
but not Ho— categorical, and in Exercise 2.91, a theory that is m— categorical 
for all infinite m. The elementary theory G of groups is not nt— categorical 
for any infinite m. The problem is whether these four cases exhaust all the 
possibilities. That this is so was proved by Morley (1965). 

2.12 GENERALIZED FIRST-ORDER THEORIES. 

COMPLETENESS AND DECIDABILITY* 

If, in the definition of the notion of first-order language, we allow a non- 
countable number of predicate letters, function letters, and individual 

'Presupposed in parts of this section is a slender acquaintance with ordinal and 
cardinal numbers (see Chapter 4; or Kamke, 1950; or Sieipinski, 1958). 
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constants, we arrive at the notion of a generalized first-order language. The! 
notions of interpretation and model extend in an obvious way to a gen- 
eralized first-order language. A generalized first-order theory in such a lan- 
guage is obtained by taking as proper axioms any set of wfs of the language. 
Ordinary first-order theories are special cases of generalized first-order 
theories. The reader may easily check that all the results for first-order 
theories, through Lemma 2.12, hold also for generalized first-order theories 
without any changes in the proofs. Lemma 2.13 becomes Lemma 2. 1 3': if the 
set of symbols of a generalized theory K has cardinality then the set of 
expressions of K also can be well-ordered and has cardinality (First, fix a 
well-ordering of the symbols of K. Second, order the expressions by their 
length, which is some positive integer, and then stipulate that if e\ and e 2 are 
two distinct expressions of the same length k , amd j is the first place in which 
they differ, then e\ precedes e 2 if the /th symbol of e } precedes the /th 
symbol of e 2 according to the given well-ordering of the symbols of K.) 
Now, under the same assumption as for Lemma 2.13', Lindenbaum’s 
Lemma 2.14' can be proved for generalized theories much as before, except 
that all the enumerations (of the wfs FSi and of the theories J,-) are transfinite, 
and the proof that J is consistent and complete uses transfinite induction. 
The analogue of Henkin’s Proposition 2.17 runs as follows. 


PROPOSITION 2.33 

If the set of symbols of a consistent generalized theory K has cardinality H a , 
then K has a model of cardinality 


Proof 

The original proof of Lemma 2.15 is modified in the following way. Add 
new individual constants b \ , & 2 , . . . , bx , .... As before, the new theory Ko is 
consistent. Let F\ ) , . . . , F) (*/ ; ) , . . . (2 < co a ) be a sequence consisting of all 
wfs of Ko with exactly one free variable. Let ( Sx ) be the sentence 
{3x h f^Fx(x h ) --T). (£/;), where the sequence b n , b - n , . . . bj , , . . . of distinct 
individual constants is chosen so that b \ j. does not occur in Fp (x f -, ) for (t < 2. 
The new theory K Kj) obtained by adding all the wfs (<S 2 ) as axioms, is proved 
to be consistent by a transfinite induction analogous to the inductive proof 
in Lemma 2. 1 5. Koo is a scapegoat theory that is an extension of K and 
contains R a closed terms. By the extended Lindenbaum Lemma 2.14', Kco 
can be extended to a consistent, complete scapegoat theory J with K* closed 
terms. The same proof as in Lemma 2.16 provides a model M of J of 
cardinality IV 
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COROLLARY 2.34 

(a) If the set of symbols of a consistent generalized theory with equality K 
has cardinality U a , then K has a normal model of cardinality less than 
or equal to U«. 

(b) If, in addition, K has an infinite normal model (or if K has arbitrarily 
large finite normal models), then K has a normal model of any car- 
dinality Up^U a . 

(c) In particular, if K is an ordinary theory with equality (i.e., U a = Uo) and 
K has an infinite normal model (or if K has arbitrarily large finite 
normal models), then K has a normal model of any cardinality 


Proof 

(a) The model guaranteed by Proposition 2.33 can be contracted to a normal 
model consisting of equivalence classes in a set of cardinality U a . Such a set 
of equivalence classes has cardinality less than or equal to U a . 

(b) Assume Up ^U a . Let b\,bx , ... be a set of new individual constants of 
cardinality Up, and add the axioms bx ^ b for X fi. As in the proof of 
Corollary 2.27, this new theory is consistent and so, by (a), has a normal 
model of cardinality less than or equal to Up (since the new theory has Up 
new symbols). But, because of the axioms bx A 6 /( , the normal model has 
exactly Up elements. 

(c) This is a special case of (b). 


Exercise 

2.94 If the set of symbols of a predicate calculus with equality K has 
cardinality U a , prove that there is an extension K' of K (with the same 
symbols as K) such that K' has normal model of cardinality U«, but K/ has 
no normal model of cardinality less than U a . 

From Lemma 2.12 and Corollary 2.34(a, b), it follows easily that, if a 
generalized theory with equality K has U a symbols, is Up-categorical for 
some and has no finite models, then K is complete, in the sense that, 

for any closed wf 98, either h K 98 or Ek ~*98 (Vaught, 1954). If not -Ek 98 and 
not-l~K ~^98, then the theories K 7 = K + {^98} and K ;/ = K + {98} are 
consistent by Lemma 2.12, and so, by Corollary 2.34(a), there are normal 
models M' and M" of K/ and K", respectively, of cardinality less than or 
equal to U«. Since K has no finite models, M 7 and M" are infinite. Hence, by 
Corollary 2.34(b), there are normal models N 7 and N" of K' and K", re- 
spectively, of cardinality Up. By the Up -categoricity of K, N' and N" must be 
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isomorphic. But, since ~>38 is true in N' and 3ft is true in N r/ , this is impossible 
by Proposition 2.32(b). Therefore, either Fk 3ft or F K ^38. 

In particular, if K is an ordinary theory with equality that has no finite 
models and is H^-categorical for some fi^O, then K is complete. As an 
example, consider the theory K 2 of densely ordered sets with neither first nor 
last element (see page 98). K 2 has no finite models and is Ho-cateogrical. 

If an ordinary theory K is axiomatic (i.e., one can effectively decide 
whether any wf is an axiom) and complete, then K is decidable, that is, there 
is an effective procedure to determine whether any given wf is a theorem. To 
see this, remember (see page 86) that if a theory is axiomatic, one can 
effectively enumerate the theorems. Any wf 38 is provable if and only if its 
closure is provable. Hence, we may confine our attention to closed wfs 38. 
Since K is complete, either 3ft is a theorem or ~>38 is a theorem, and, 
therefore, one or the other will eventually turn up in our enumeration of 
theorems. This provides an effective test for theoremhood. Notice, that if K 
is inconsistent, then every wf is a theorem and there is an obvious decision 
procedure; if K is consistent, then not both 3ft and -1 -3ft can show up as 
theorems and we need only wait until one or the other appears. 

If an ordinary axiomatic theory with equality K has no finite models and 
is H/j-categorical for some /3^0, then, by what we have proved, K is de- 
cidable. In particular, the theory K 2 discussed above is decidable. 

In certain cases, there is a more direct method of proving completeness or 
decidability. Let us take as an example the theory IC 2 of densely ordered sets 
with neither first nor last element. Langford (1927) has given the following 
procedure for K 2 . Consider any closed wf -3ft. By proposition 2.30, we can 
assume that 33ft is in prenex normal form {Q\ >’ 1 ) . . . where ft! con- 
tains no quantifiers. If ( Q n y „ ) is (Vy„), replace by ~i(3y„)~i ( i3 . In all 

cases, then, we have, at the right side of the wf, (3y /; )®, where 3ft has no 
quantifiers. Any negation x ^ y can be replaced by x < y V y < x, and 
~>(x < y ) can be replaced by x = y V y < x. Hence, all negation signs can be 
eliminated from 3ft. We can now put 3ft into disjunctive normal form, that is, a 
disjunction of conjunctions of atomic wfs (see Exercise 1.42). Now 
(3 V 382 V ... V 3ft>k) is equivalent to (3y„)33\ V (3y„) 3fti V ... V (3\y n )38y. 
Consider each (3 y n )3fti separately. 3ft> ( is a conjunction of atomic wfs of the 
form t < s and t = s. If 38 j does not contain y„, just erase (3y„). Note that, if 
a wf I does not contain y„, then (3y„)(<f A may be replaced by 
$ A (3y„)#\ Hence, we are reduced to the consideration of ( 3y„ ) 3F , where 
3^ is a conjunction of atomic wfs of the form t < s or t = s, each of which 
contains y„. Now, if one of the conjuncts is y„ = z for some z different from 
y„, then replace in 3ft all occurrences of y„ by z and erase ( 3y„ ) . If we have 
y„ = y„ alone, then just erase (3>’„). If we have y„ — y„ as one conjunct 
among others, then erase y„ = y„. If 3ft has a conjunct y„ < y„, then replace 
all of (3>’„)JA by y„ < y„- If 3ft consists of y n < z„ /\ ... f\ y„ < zj 
f\u\ < y„ A ... A 11 m < y„, then replace (3y„)3ft by the conjunction of all the 
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wfs Mi < z P f° r 1 </w and 1 If all the ns or all the z p s are missing, 

replace (=)>’„ ),^ r by y>n =>’«• This exhausts all possibilities and, in every case, 
we have replaced (3y„)^ by a wf containing no quantifiers, that is, we have 
eliminated the quantifier (3y„). We are left with (giTi) . . . {Q n -\y n -\)^, 
where ^ contains no quantifiers. Now we apply the same procedure suc- 
cessively to {Qn-\y n -\)-, - • • > (Qiyi)- Finally we are left with a wf without 
quantifiers, built up of wfs of the form x = x and x < x. If we replace x = x 
by v = x => x = x and x < x by -i(x = x =>■ x = x), the result is either an 
instance of a tautology or the negation of such an instance. Hence, by 
Proposition 2.1, either the result or its negation is provable. Now, one can 
easily check that all the replacements we have made in this whole reduction 
procedure applied to 'M have been replacements of wfs by other wfs 6 l( 
such that Kk Hence, by the replacement theorem, if our final result 

is provable, then so is the original wf 08 ^ and, if is provable, then so is 
S. Thus, K -2 is complete and decidable. 

The method used in this proof, the successive elimination of existential 
quantifiers, has been applied to other theories. It yields a decision procedure 
(see Hilbert and Bernays, 1934, § 5) for the pure theory of equality Ki (see 
page 98). It has been applied by Tarski (1951) to prove the completeness and 
decidability of elementary algebra (i.e., of the theory of real-closed fields; see 
van der Waerden, 1949) and by Szmielew (1955) to prove the decidability of 
the theory Gc of abelian groups. 


Exercises 

2.95 (Henkin, 1955) If an ordinary theory with equality K is finitely ax- 
iomatizable and ^^-categorical for some a, prove that K is decidable. 

2.96 (a) Prove the decidability of the pure theory Kj of equality. 

(b) Give an example of a theory with equality that is R a -categorical 
for some a, but is incomplete. 


Mathematical applications 

1. Let F be the elementary theory of fields (see page 98). We let n stand for 
the term 1 + 1 -I- . . . + 1, consisting of the sum of n Is. Then the assertion 
that a field has characteristic p can be expressed by the wf %> p : p = 0. A field 
has characteristic 0 if and only if it does not have characteristic p for any 
prime p. Then for any closed wf & of F that is true for all fields of char- 
acteristic 0, there is a prime number q such that 08 is true for all fields of 
characteristic greater than or equal to q. To see this, notice that, if Fo is 
obtained from F by adding as axioms ->^ 2 , • • • , ^ P , • • • (for all primes 

p), the normal models of Fo are the fields of characteristic 0. Hence, by 
Exercise 2.77, Ef 0 but then, for some finite set of new axioms 
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~ >< 3q { , ~'^>q 2 > • • • , ~^>q n » we h ave , ~i^ 2 , . . . , -i^ hp Let g be a 
prime greater than all q n , In every field of characteristic greater than 

or equal to q, the wfs ~A€q x , ~$>q , • • • , are true; hence, :M is also true, 
(Other applications in algebra may be found in A. Robinson (1951) and 
Cherlin (1976).) 

2. A graph may be considered as a set with a symmetric binary relation ft 
(i.e., the relation that holds between two vertices if and only if they are 
connected by an edge). Call a graph ^-colourable if and only if the graph can 
be divided into k disjoint (possibly empty) sets such that no two elements in. 
the same set are in the relation R. (Intuitively, these sets correspond to k 
colours, each colour being painted on the points in the corresponding set, 
with the proviso that two points connected by an edge are painted different 
colours.) Notice that any subgraph of a ^-colourable graph is ^-colourable. 
Now we can show that, if every finite subgraph of a graph ^ is ^-colourable, 
and if *3? can be well-ordered, then the whole graph 0 is ^-colourable. To 
prove this, construct the following generalized theory with equality K (Beth, 
1953). There are two binary predicate letters, Aj(=) and A\ (corresponding 
to the relation R on ^); there are k monadic predicate letters A },..., A\ 
(corresponding to the k subsets into which we hope to divide the graph); and 
there are individual constants a c , one for each element c of the graph As 
proper axioms, in addition to the usual assumptions (A6) and (A7), we have 
the following wfs: 

(I) -iA^(jc,x) (irreflexivity of R) 

(II) ^(w) => A^iy,*) (symmetry of R) 

(III) (Vx) (A } (x) V A\ (x) V . . . V Al_ (x) ) (division into k classes) 

(IV) (Vx)-i(y4j(x) A A- (x) ) , for 1 (disjointness of the k 

■ classes) 

(V) (Vx) (Vj) (Aj (x) A Aj(y) => ~<Aj(x > >’)) for 1 < / < k (two elements of the 
same class are not in the relation R) 

(VI) at / a c , for any two distinct elements b and c of ^ 

(VII) Al(a b ,a c ), if R(b,c) holds in & 

Now, any finite set of these axioms involves only a finite number of the 
individual constants a Ct , . . . ,a Cn , and since the corresponding subgraph 
{ci,...,c„} is, by assumption, ^-colourable, the given finite set of axioms 
has a model and is, therefore, consistent. Since any finite set of axioms is 
consistent, K is consistent. By Corollary 2.34(a), K has a normal model of 
cardinality less than or equal to the cardinality of c §. This model is a k- 
colourable graph and, by (VI)-(VII), has ^ as a subgraph. Hence *3? is also k- 
colourable. (Compare this proof with a standard mathematical proof of the 
same result by Bruijn and Erdos (1951). Generally, use of the method above 
replaces complicated applications of Tychonoffs theorem or Konig’s Un- 
endlichkeits lemma.) 
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Exercises 

2.97 a (Los, 1954b) A group B is said to be orderable if there exists a binary 
relation R on B that totally orders B such that, if xRy, then (x + z)R{}> + z) 
and (z + x)R(z + >’)• Show, by a method similar to that used in Example 2 
above, that a group B is orderable if and only if every finitely generated 
subgroup is orderable (if we assume that the set B can be well-ordered). 
2.98 a Set up a theory for algebraically closed fields of characteristic p(^0) 
by adding to the theory F of fields the new axioms P„, where P„ states that 
every non-constant polynomial of degree n has a root, as well as axioms that 
determine the characteristic. Show that every wf of F that holds for one 
algebraically closed field of characteristic 0 holds for all of them. [Hint: This 
theory is R^-categorical for /? > 0, is axiomatizable, and has no finite 
models. See A. Robinson (1952).] 

2.99 By ordinary mathematical reasoning, solve the finite marriage problem. 
Given a finite set M of m men and a set N of women such that each man 
knows only a finite number of women and, for 1 ^k^m, any subset of M 
having k elements knows at least k women of N (i.e., there are at least k 
women in N who know at least one of the k given men), then it is possible to 
marry (monogamously) all the men of M to women in N so that every man is 
married to a women whom he knows. [Hint (Halmos and Vaughn, 1950): 
m — 1 is trival. For m > l, use induction, considering the cases: (I) for all k 
with K k < m, every set of k men knows at least k + 1 women; and (II) for 
some k with 1 ^k < m, there is a set of k men knowing exactly k women.] 
Extend this result to the infinite case, that is, when M is infinite and well- 
orderable and the assumptions above hold for all finite k. [Hint: Construct 
an appropriate generalized theory with equality, analogous to that in Ex- 
ample 2 above, and use Corollary 2.34(a).] 

2.100 Prove that there is no generalized theory with equality K, having one 
predicate letter < in addition to = , such that the normal models of K are 
exactly those normal interpretations in which the interpretation of < is a 
well-ordering of the domain of the interpretation. 

Let ^ be a wf in prenex normal form. If % is not closed, form its closure 
instead. Suppose, for example, $ is (3>’i ) (Vj^ ) ( V> 3 ) ( 3 j 4 ) ( 3>’s ) ( V>'6 ) 
^(yi )T 2 »p 35 T 4 )T 5 ?T6)> where contains no quantifiers. Erase (3yi) and re- 
place yi in ^ by a new individual constant bp. ( Vy? ) (V>’ 3 ) ( 3v 4 ) (3 j 5 ) ( ) 

Erase (Vy 2 ) and (V> 3 ), obtaining (3>’ 4 )(3>’ 5 )(V>’ 6 ) 
%(bi ,T 2 ,T 3 ,T 4 ,T 5 ,; ; 6 )- Now erase (3y 4 ) and replace^ in by g{yi,yfi}, where 
g is a new function letter: (3>- 5 )(V>’6)^’(^i G^G' 3 , ^(^,>’ 3 ), >’ 5 , >’ 6 )- Erase (3y 5 ) 
and replace >’5 by where h is another new function letter: (Vje) 

^{b\ ,T 2 ,T 3 , 0 (T 2 ,b)^(T 2 ,T 3 ),T 6 )- Finally, erase (V>’ 6 ). The resulting wf 
1 > 2 ,yi, fl 1 J’3 ) 1 A(>’ 2 , J ’3 ) , j’o) contains no quantifiers and will be denoted 
by 0 $*. Thus, by introducing new function letters and individual constants, 
we can eliminate the quantifiers from a wf. 
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Examples 

1. If 88 is (V> 1 ) (=^ 2 ) (Ykj ) (V>’4) ( 3y s ) <8 (>’1 ,)> 2 , >'3 , >' 4 , y s ) , where is quantifier- 
free, then 88* is of the form , <?(>’! ) > >’ 3 , >’ 4 , h (y x , y 3 , y 4 ) ) . 

2. If 88 is ( 3j>i ) ( 3^ ) ( V>’ 3 ) ( Vy 4 ) ( 3y 5 ) ( vi , y 2 , >3 , >’4 , >’ 5 ) , where <8 is quantifier- 
free, then 88* is of the form ^{b , c, yi , jh , g (>’3 , >’ 4 ) ) . 


Notice that 88* F 88, since we can put the quantifiers back by applications 
of Gen and rule E4. (To be more precise, in the process of obtaining 88*, we 
drop all quantifiers and, for each existentially quantified variable y u we 
substitute a term g{z\ , . . . ,z*), where g is a new function letter and zi, . . . , 2 ,. 
are the variables that were universally quantified in the prefix preceding 
(3yy). If there are no such variables zi, . . . ,z*, we replace y,- by a new in- 
dividual constant.) 


PROPOSITION 2.35 (SECOND e-THEOREM) 

(Rasiowa, 1956; Hilbert and Bernays, 1939) Let K be a generalized theory. 
Replace each axiom 88 of K by 88* . (The new function letters and individual 
constants introduced for one axiom are to be different from those in- 
troduced for another axiom.) Let K* be the generalized theory with the 

proper axioms 88*. Then: 

(a) If 88 is a wf of K and Fk* 88, then Fk 88. 

(b) K is consistent if and only if K* is consistent. 

Proof 

(a) Let ^ be a wf of K such that F K - 88. Consider the ordinary theory K° 
whose axioms 88 \ , . . . , are such that 88 \ , . . . , 88* t are the axioms used in 
the proof of 88. Let K°* be the theory whose axioms are 88\, . . . , 88*. Hence 
F k °* 88. Assume that M is a denumerable model of K°. We may assume that 
the domain of M is the -set P of positive integers (see Exercise 2.88). Let 88 be 
any axiom of K°. For example, suppose that 88 has the form 
(3>’i)(Vy2)(V}<’3) (3>’4)^(>’U ; 25>’3,>’4), where is quantifier-free. 88* has the 
form r 6 ’ ( b , >’2 ,yi,g(yi, >’ 3 ) ) • Extend the model M step by step in the following 
way (noting that the domain always remains P); since 88 is true for M, 
( 3y , i ) ( Vy 2 ) ( Vk? ) ( 3y4 ) (yi , yj , 33 ■> y' 4 ) is true for M. Let the interpretation b* of 
b be the least positive integer y\ such that ( Vy 2 ) (V> 3 ) ( 3y 4 ) <8 (y x , y 2 , >’ 3 , ) is 

true for M. Hence, ( ^ ^ ? >’ 2 , JF3 , 3 ^ 4 ) is true in this extended model. For 
any positive integers y 2 and >’3, let the interpretation of g , >' 3 ) be the least 
positive integer >’4 such that ^(£,>’ 2 , >’3, >’4) is true in the extended model. 
Hence, ^(>’ 2 , 33 )) is true in the extended model. If we do this for all 

the axioms 88 of K°, we obtain a model M* of K°*. Since Fk°* 88,88 \% true 
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for M*. Since M* differs from M only in having interpretations of the new 
individual constants and function letters, and since 3 does not contain any 
of those symbols, 3 is true for M. Thus, 3 is true in every denumerable 
model of K°. Hence, I~k c 3, by Corollary 2.20(a). Since the axioms of K° are 
axioms of K, we have I~k 3. (For a constructive proof of an equivalent 
result, see Hilbert and Bernays (1939).) 

b) Clearly, K* is an extension of K, since 28* F 28. Hence, if K* is con- 
sistent, so is K. Conversely, assume K is consistent. Let 3 be any wf of K. If 
K* is inconsistent, F K * 3 A ~>3. By (a), Fk 3 A -> 3, contradicting the 
consistency of K. 


121 


Let us use the term generalized completeness theorem for the proposition 
that every consistent generalized theory has a model. If we assume that every 
set can be well-ordered (or, equivalently, the axiom of choice), then the 
generalized completeness theorem is a consequence of Proposition 2.33. 

By the maximal ideal theorem (MI) we mean the proposition that every 
proper ideal of a Boolean algebra can be extended to a maximal ideal.t This 
is equivalent to the Boolean representation theorem, which states that every 
Boolean algebra is isomorphic to a Boolean algebra of sets. (Compare Stone 
(1936). For the theory of Boolean algebras, see Sikorski (1960) or Men- 
delson (1970).) The usual proofs of the MI theorem use the axiom of choice, 
but it is a remarkable fact that the MI theorem is equivalent to the gen- 
eralized completeness theorem, and this equivalence can be proved without 
using the axiom of choice. 


PROPOSITION 2.36 

(Los, 1954a; Rasiowa and Sikorski, 1951; 1952) The generalized complete- 
ness theorem is equivalent to the maximal ideal theorem. 


Proof 

(a) Assume the generalized completeness theorem. Let B be a Boolean al- 
gebra. Construct a generalized theory with equality K having the binary 
function letters U and fl, the singulary function letter // [we denote Jf (/) by 
7], predicate letters = and A\, and, for each element b in B, an individual 
constant at- By the complete description of B, we mean the following 
sentences: (i) at ^ a c if b and c are distinct elements of B; (ii) at U a c — aj if 
b, c, d are elements of B such that b U c = d in B; (iii) at fl a c = a e if b , c, e 
are elements of b such that b n c = e in B; and (iv) at = a c if b and c are 
elements of B such that b = c in B, where b denotes the complement of b. As 

1 Since {0} is a proper ideal of a Boolean algebra, this implies (and is implied by) 
the proposition that every Boolean algebra has a maximal ideal. 
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axioms of K we take a set of axioms for a Boolean algebra, axioms (A6) and 
(A7) for equality, the complete description of B, and axioms asserting that 
A\ determines a maximal ideal (i.e., ^}(xnx), yf}(x) f\A\(y) =>^4}(xU};) 
^}(x) ^}(x) Vy4}(x), and ^{(jtlli)). Now K is consistent, for, 

if there were a proof in K of a contradiction, this proof would contain only 
a finite number of the symbols a/,, a c , . . . — say, ai n , . . . , cii n . The elements 
b\ , . . . , b„ generate a finite subalgebra B' of B. Every finite Boolean algebra 
clearly has a maximal ideal. Hence, B' is a model for the wfs that occur in 
the proof of the contradiction, and therefore the contradiction is true in B', 
which is impossible. Thus, K is consistent and, by the generalized com- 
pleteness theorem, K has a model. That model can be contracted to a 
normal model of K, which is a Boolean algebra A with a maximal ideal 1. 
Since the complete description of B is included in the axioms of K, B is a 
subalgebra of A, and then I P| B is a maximal ideal in B. 

(b) Assume the maximal ideal theorem. Let K be a consistent generalized 
theory. For each axiom (A of K, form the wf PA* obtained by constructing a 
prenex normal form for PA and then eliminating the quantifiers through the 
addition of new individual constants and function letters (see the example 
preceding the proof of Proposition 2.35). Let K # be a new theory having the 
wfs PA*, plus all instances of tautologies, as its axioms, such that its wfs 
contain no quantifiers and its rules of inference are modus ponens and a rule 
of substitution for variables (namely, substitution of terms for variables). 
Now, K # is consistent, since the theorems of K# are also theorems of the 
consistent K* of Proposition 2.35. Let B be the Lindenbaum algebra de- 
termined by Kf (i.e., for any wfs and S>, let ^ Eq PA mean that 
h K # C A Eq is an equivalence relation; let [7f] be the equivalence class of 
7f; define [7f] U [£] = [7f V PA ] , [7f] n [£2>] = A £$], [ ( P] = f-i#]; under these 
operations, the set of equivalence classes is a Boolean algebra, called the 
Lindenbaum algebra of K # ). By the maximal ideal theorem, let I be a 
maximal ideal in B. Define a model M of having the set of terms of K# 
as its domain; the individual constants and function letters are their own 
interpretations, and, for any predicate letter A’J, we say that A'Ut\ is 

true in M if and only if [A'j (t\ , . . . , t /; )] is not in I. One can show easily that a 
wf ^ of K# is true in M if and only if is not in I. But, for any theorem PA 
of K#, [PA] = l, which is not in I. Hence, M is a model for K # . For any 
axiom PA of K, every substitution instance of PA*(y\ , . . . ,y n ) is a theorem in 
K#; therefore, PA* (y \ , • . • , y „ ) is true for all yi> ■ ■ ■ , y„ in the model. It follows 
easily, by reversing the process through which 08* arose from PS, that PA is 
true in the model. Hence, M is a model for K. 


The maximal ideal theorem (and, therefore, also the generalized com- 
pleteness theorem) turns out to be strictly weaker than the axiom of choice 
(see Halpern, 1964). 
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2.101 Show that the generalized completeness theorem implies that every 
set can be totally ordered (and, therefore, that the axiom of choice holds for 
any set of non-empty disjoint finite sets). 

The natural algebraic structures corresponding to the propositional cal- 
culus are Boolean algebras (see Exercise 1.60, and Rosenbloom, 1950, 
chaps 1 and 2). For first-order theories, the presence of quantifiers in- 
troduces more algebraic structure. For example, if K is a first-order theory, 
then, in the corresponding Lindenbaum algebra B, [(3x)^(x)] = L t [^(t)\, 
where L, indicates the least upper bound in B, and t ranges over all terms of 
K that are free for x in &(x). Two types of algebraic structure have been 
proposed to serve as algebraic counterparts of quantification theory. The 
first, cylindrical algebras, have been studied extensively by Tarski, 
Thompson, Henkin, Monk and others (see Henldn, Monk and Tarski, 
1971). The other approach is the theory of polyadic algebras, invented and 
developed by Halmos (1962). 


2.13 ELEMENTARY EQUIVALENCE. ELEMENTARY EXTENSIONS 

Two interpretations Mi and M 2 of a generalized first-order language J5f are 
said to be elementarily equivalent (written Mi = M 2 ) if the sentences of J 2 ; 
true for Mi are the same as the sentences true for M 2 . Intuitively, Mi = M 2 
if and only if Mi and M 2 cannot be distinguished by means of the language 
Y 7 . Of course, since J5? is a generalized first-order language, may have 
non-denumerably many symbols. 

Clearly, (1) M = M; (2) if Mi = M 2 , then M 2 = Mi; (3) if Mi = M 2 and 
M 2 s M 3 , then Mi = M 3 . 

Two models of a complete theory K must be elementarily equivalent, 
since the sentences true in these models are precisely the sentences provable 
in K. This applies, for example, to any two densely ordered sets without first 
or last elements (see page 116). 

We already know, by Proposition 2.32(b), that isomorphic models are 
elementarily equivalent. The converse, however, is not true. Consider, for 
example, any complete theory K that has an infinite normal model. By 
Corollary 2.34(b), K has normal models of any infinite cardinality R*. If we 
take two normal models of K of different cardinality, they are elementarily 
equivalent but not isomorphic. A concrete example is the complete theory 
K 2 of densely ordered sets that have neither first nor last element. The 
rational numbers and the real numbers, under their natural orderings, are 
elementarily equivalent non-isomorphic models of K 2 . 
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Exercises 

2.102 Let Kao, the theory of infinite sets, consist of the pure theory Kj of 
equality plus the axioms where asserts that there are at least « 
elements. Show that any two models of Koo are elementarily equivalent (see 
Exercises 2.66 and 2.96(a)). 

2.103 d If Mi and M 2 are elementarily equivalent normal models and Mi is 
finite, prove that Mi and M 2 are isomorphic. 

2.104 Let K be a theory with equality having K a symbols. 

(a) Prove that there are at most 2 Ka models of K, no two of which are 
elementarily equivalent. 

(b) Prove that there are at most 2*9 mutually non-isomorphic models of K 
of cardinality fty, where y is the maximum of a and /?. 

2.105 Let M be any infinite normal model of a theory with equality K 
having R* symbols. Prove that, for any cardinal R, ^ R a , there is a normal 
model M* of K of cardinality R a such that M = M*. 

A model M 2 of a language is said to be an extension of a model Mi of 
££ (written Mi C M 2 )^ if the following conditions hold: 

1 . The domain D\ of Mi is a subset of the domain D 2 of M 2 . 

2. For any individual constant c of J5?, c Mz = c M| , where and c Ml are 
the interpretations of c in M 2 and Mi. 

3. For any function letter f? of ££ and any b\,...,b n in D, 

4. For any predicate letter A'j of ££ and any b \ , . . . , b n in D, 

\= M\A n j[b \, . . . , b„] if and only if ^=m 2 tf[b 1 , . . . , b n \. 

When Mi C M 2 , one also says that M 3 is a substructure (or submodel ) of M 2 . 


Examples 

1. If 5? contains only the predicate letters = and <, then the set of rational 
numbers under its natural ordering is an extension of the set of integers 
under its natural ordering. 

2. If & is the language of field theory (with the predicate letter =, function 
letters + and x, and individual constants 0 and l), then the field of real 
numbers is an extension of the field of rational numbers, the field of 
rational numbers is an extension of the ring of integers, and the ring of 
integers is an extension of the ‘semiring’ of non-negative integers. For 
any fields Fi and F 2 , Fi C F 2 if and only if Fi is a subfield of F 2 in the 
usual algebraic sense. 


^The reader will have no occasion to confuse this use of C with that for the 
inclusion relation. 
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Exercises 

;|,106 Prove: 

(a) M C M; 

v0) if Mi C M 2 and M 2 C M 3 , then Mi C M 3 ; 

( C ) If Mi C M 2 and M 2 C Mi, then Mi = M 2 . 

12.107 Assume Mi C M 2 - 

(a) Let P/i{x \ ,...,*„) be a wf of the form (Vgi) . . . (Vy,,,)#^, . . .x„, 

yi . .. ,y m ), where # is quantifier-free. Show that, for any in 

the domain of Mi, if f=M 2 8ft\b\, . • • ,b n ], then [=m, &[bi, . . . , b„\. In 
particular, any sentence (Vj-'i ) . . . (V y^f8{y\ ■ ■ ■ , y,» ) , where # is quan- 
tifier-free, is true in Mi if it is true in M 2 . 

(b) Let 88(xi, . . . ,x„) be a wf of the form (B^i ) . . . (3y m )^ (xi 

yi , . . . ,y m ), where ^ is quantifier-free. Show that, for any bi,...,b„ in 
the domain of Mi, if [=m, 8$\b \ , . . . , 6 „],then [=m 2 $\b \ , . . . , £„]. In 
particular, any sentence (Ejyi) . . . ( 3y m ) ^{y\ . . . , y ,„ ) , where %> is quan- 
tifier-free, is true in M 2 if it is true in Mi . 

2.108 (a) Let K be the predicate calculus of the language of field theory. 

Find a model M of K and a non-empty subset X of the domain 
D of M such that there is no substructure of M having domain 

(b) If K is a predicate calculus with no individual constants or 
function letters, show that, if M is a model of K and X is a subset 
of the domain D of M, then there is one and only one sub- 
structure of M having domain X. 

(c) Let K be any predicate calculus. Let M be any model of K and 
let X be any subset of the domain D of M. Let Y be the inter- 
section of the domains of all submodels M* of M such that A is a 
subset of the domain Dm* of M* . Show that there is one and only 
one submodel of M having domain Y. (This submodel is called 
the submodel generated by X.) 


A somewhat stronger relation between interpretations than ‘extension’ is 
useful in model theory. Let Mi and M 2 be models of some language Jzf . We 
say that M 2 is an elementary extension of Mi (written Mi < e M 2 ) if 
(1)M, C M 2 and (2) for any wf 8d{y\ ... ,y„) of $£ and for any b\ , . . . , b„ in 
the domain D\ of Mj , f=M, 88\b \ , . . . , b„ } if and only if [=m 2 8d[b \ , . . . , b„]. (In 
particular, for any sentence 83 of 8ft is true for Mi if and only if 8ft is true 
for M 2 .) When Mi ^ e M 2 , we shall also say that Mi is an elementary sub- 
structure (or elementary submodel) of M 2 . 

It is obvious that, if Mi < e M 2 , then Mj C M 2 and Mi = M 2 . The con- 
verse is not true, as the following example shows. Let G be the elementary 
theory of groups (see page 98). G has the predicate letter =, 
function letter +, and individual constant 0. Let I be the group of integers 
and E the group of even integers. Then DC/ and I = E. (The function g 
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such that g(x) = 2x for all x in I is an isomorphism of I with E.) Consider the 
wf y$(y): (3x)(x + x — y). Then (=/ &\2], but not-f= £ 38[2], Thus, / is not an 
elementary extension of E. (This example shows the stronger result that even 
assuming Mi C M2 and Mi — M2 does not imply Mi ^eM 2 .) 

The following theorem provides an easy method for showing that 

Mi < e M 2 . 


PROPOSITION 2.37 (Tarski and Vaught, 1957) 

Let Mi C M 2 , Assume the following condition: 

($) For every wf $(x\ , . . . x k ) of the form (3y)^(x\ , . . .x k ,y) and for all 
b\,,..,b k in the domain D\ of Mi, if (=m 2 J’f&i, . . . , b k ], 
then there is some d in D\ such that (=m 2 ^[b\ , . . . ,b k ,d]. 

Then Mi ^ e M 2 . 


Proof 

Let us prove: 

(*) S>[b\ - . - • , b k J if and only if (=m, 3[b\ ,... ,b k ] for any wf 

3){x\ , . . . , Xi ) and any b\ , . . . , b k in £>1 . 

The proof is by induction on the number m of connectives and quantifiers in 
£). If m — 0, then (*) follows from clause 4 of the definition of Mi CM 2 . 
Now assume that (*) holds true for all wfs having fewer than m connectives 
and quantifiers. 

Case 1. Q) is -><f. By inductive hypothesis, f=M, <f[&i, . . . , b k \ if and only if 
J=m 2 &[bi , . . • , h*]. Using the fact that not - f=M, &[b \ , . . . , b k ] if and only if 
J=m, -«f[6i, . . . , b k ] , and similarly for M 2 , we obtain (*). 

Case 2. 3 is £ 3F. By inductive hypothesis, (= Ml £\b \ , - . - , b k ] if and 

only if )=m 2 <?[bi , . . - , b k \ and similarly for 2F . (*) then follows easily. 

Case 3. <3 is (3y)<f (xi, ...x„,y). By inductive hypothesis, 

(**) Hm, $[b \ , - . . , b kt d] if and only if |= M2 S[bi ,...,b k ,d], 
for any b\ , . . . , b k , d in D\ . 

Case 3a. Assume f=M, (3y)S , (x 1 , . . . x k , y) [b \ , . . . , b k ] for some b \ , . . . , b k in 
£> 1 . Then ^=m, • • • ,b k ,d\ for some d in D\. So, by (**), 

|=m 2 S'[bi,. ..,b k ,d\. Hence, (=m 2 {3y)S{x\ 

J * • * y)[b u . . . , b k ]. 

Case 3b. Assume |=m 2 , . . .x k ,y)\b \ , b k ] for some bi,. .. ,b k in 

D\. By assumption ($), there exists d in D\ such that (=m 2 S[b\ , . . .,b k ,d\. 
Hence, by (**), f=M, £> [&i , • • • , b kl d\ and therefore |=m, {3y)£{x\, . . .x k ,y) 
, • . - , b . 
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This completes the induction proof, since any wf is logically equivalent to 
a wf that can be built up from atomic wfs by forming negations, condi- 
tionals and existential quantifications. 


Exercises 

2.109 Prove: 

(a) IVKcM; 

(b) if Mi < e M 2 and M 2 ^ e M 3 , then Mi =^ e M 3 ; 

(c) if Mi ^ e M and M 2 ^ e M and Mi C M 2 , then Mi ^ e M 2 . 

2.110 Let K be the theory of totally ordered sets with equality (axioms 
(a) ( c ) and (e) -(g) of Exercise 2.67). Let Mi and M 2 be the models for K 
with domains the set of positive integer's and the set of non-negative integers, 
respectively (under their natural orderings in both cases). Prove that 
Mi C M 2 and Mi ~ M 2 , but Mi ^ e M 2 . 

Let M be an interpretation of a language . Extend to a language $£* 
by adding a new individual constant a d for every member d of the domain of 
M. We can extend M to an interpretation of Sd* by taking d as the inter- 
pretation of a d . By the diagram of M we mean the set of all true sentences of 
M of the forms A’l (a dl ,...,a dn ), -i A*. (a di ,...,a dn ), and f* (a dx , . . . , a dn ) = a dm . 
In particular, a d[ ^ a d? belongs to the diagram if d\ / <7 2 - By the complete 
diagram of M we mean the set of all sentences of that are true for M . 

Clearly, any model of the complete diagram of M determines an 
elementary extension of M,* and vice versa. 


Exercise 

2.111 (a) Let Mi be a denumerable normal model of an ordinary theory K 

with equality such that every element of the domain of Mi is the 
interpretation of some closed term of K. 

(i) Show that, if Mi C M 2 and Mi = M 2 , then Mj =^ e M 2 . 

(ii) Prove that there is a denumerable normal elementary ex- 
tension M 3 of Mi such that Mi and M 3 are not isomorphic. 

(b) Let K be a predicate calculus with equality having two function 
letters + and x and two individual constants 0 and 1 . Let M be 
the standard model of arithmetic with domain the set of natural 
numbers, and +, x,0 and l having their ordinary meaning. 
Prove that M has a denumerable normal elementary extension 
that is not isomorphic to M, that is, there is a denumerable 
nonstandard model of arithmetic. 

Nhe elementary extension M # # of M is obtained from M# by forgetting about 
the interpretations of the a d s. 
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PROPOSITION 2.38 (UPWARD SKOLEM-LOWENHEIM-TARSKI 
THEOREM) 

Let K be a theory with equality having R a symbols, and let M be a normal 
model of K with domain of cardinality Let y be the maximum of a and (i 

Then, for any <5 ^ y, there is a model M* of cardinality R^ such that M ^ 
and M ^ e M*. 


Proof 

Add to the complete diagram of M a set of cardinality R <5 of new individual 
constants b T , together with axioms b x yk b p for distinct t and p and axioms 
b x yk a ( j for all individual constants a ( i corresponding to members d of the 
domain of M. This new theory K # is consistent, since M can be used as a 
model for any finite number of axioms of K # . (If b Xl , . . . , b Xk , , . . . , aj m are 

the new individual constants in these axioms, interpret b X] , . . . , b Xk as distinct 
elements of the domain of M different from d \ , . . . , d m . ) Hence, by Corollary 
2.34 (a), has a normal model of cardinality R<? such that M C IVF, 
M ^4 M#, and M 


PROPOSITION 2.39 (DOWNWARD SKOLEM-LOWENHEIM- 
TARSKI THEOREM) 

Let K be a theory having R a symbols, and let M be a model of K with 
domain of cardinality ft, ^ R a . Assume A is a subset of the domain D of M 
having cardinality n, and assume is such that R v max(R a , n). Then 

there is an elementary submodel M* of M of cardinality R/? and with domain 
D* including A. 


Proof 

Since n^R/j^Ry, we can add R^ elements of D to A to obtain a larger set B 
of cardinality R/?. Consider any subset C of D having cardinality Wp. For 
every wf &(y \, . • . ,y,i,z) of IC, and any in C such that 

|=m . . ■ , , z) [ci , . . . , c„] , add to C the first element d of D (with 

respect to some fixed well-ordering of D) such that |=m (3z)^[c ]? . . . ,c„,d\. 
Denote the so-enlarged set by C#. Since IC has R a symbols, there are R a wfs. 
Since R a ^R/j, there are at most Rp new elements in C# and, therefore, the 
cardinality of C& is R^. Form by induction a sequence of sets Co, Cj, . . . by 
setting Co = B and C n+ \ =- Cf. Let IT ~ U„ ew C„. Then the cardinality of IT 
is Kp. In addition, IT is closed under ail the functions (Assume 

d \ , . . . , d„ in IT . We may assume d \ , . . . , d„ in Q- for some k. Now 
f=M • • , X B ) = z)[d \,. . . , d n ]. Hence, (J?) u {d\,. . . ,d„), being the 
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first and only member dofD such that f= M (JJ(x \ , . . . , x „ ) = z)[d \, . . . , d„, d\, 
,tnu st belong to cf = C k+ i CD*.) Similarly, all interpretations (^-) M of in- 
dividual constants are in D*. Hence, D* determines a substructure M* of M. 
To show that M* < e M, consider any wf <%{y\ ,... ,y n ,z) and any d \ , . . . , d„ in 
D* such that [=m (=br)^(yi , . . . ,y„,z)[di,. . . ,d„]. There exists C k such that 
A .... ,d„ are in C k . Let d be the first element of D such that 
f=M @[du- Then d e cf = C ky \ CD*. So, by the Tarski-Vaught 

theorem (Proposition 2.37) M*^ e M. 
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2,14 ULTRAPOWERS. NON-STANDARD ANALYSIS 

By a filter 1 on a non-empty set A we mean a set :F of subsets of A such that: 

1. A e F 

2. B e CF ACe.f ^BnCe.f 

3. Be^ABCC^Ce^ 

Examples 

Let B C A. The set Fb — {C \ B C C C A} is a filter on A. CF b consists of all 
subsets of A that include B. Any filter of the form CF B is called a principal 
filter. In particular, CF A = {A} and = CP {A) are principal filters. The filter 
£P(A) is said to be improper and every other filter is said to be proper. 


Exercises 

2.112 Show that a filter CF on A is proper if and only if 0 ^ CF . 

2.113 Show that a filter CF on A is a principal filter if and only if the 
intersection of all sets in is a member of CF . 

2.114 Prove that every finite filter is a principal filter. In particular, any 
filter on a finite set A is a principal filter. 

2.115 Let A be infinite and let CF be the set of all subsets of A that are 
complements of finite sets: CF = {C\(3W)(C — A — W A Fin(JT)}, where 
Fin(lT) means that W is finite. Show that CF is a non-principal filter on A. 

2.116 Assume A has cardinality fyj. Let Let CF be the set of all 

subsets of A whose complements have cardinality < P a . Show that CF is a 
non-principal filter on A. 

2.117 A collection ^ of sets is said to have the finite intersection property if 
B\ fl n . . . fl B k ^ 0 for any sets B \ , 5 2 , . . . , B k in If ^ is a collection of 


tThe notion of a filter is related to that of an ideal. A subset F of CP (A) is a filter 
on A if and only if the set ^ — {A — B \ B 6 F} of complements of sets in F is an 
ideal in the Boolean algebra CP{A). Remember that 'CP {A) denotes the set of all subsets 
of A. 
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subsets of A having the finite intersection property and FA is the set of all 
finite intersections B\ n Bj D . . . n2?* of sets in A show that 
AF = {Z)|(3C)(B E XF AC CD C A)} is a proper filter on A. 


DEFINITION 


A filter AF on a set A is called an ultrafilter on A if AF is a maximal proper 
filter on A, that is, 3F is a proper filter on A and there is no proper filter <3 on 
A such that AF c 

Example 

Let d E A. The principal filter AF $ - {B\d E B AB C A} is an ultrafilter on A. 
Assume that ^ is a filter on A such that AF d c A Let C E — AF Then 
CCA and d C. Hence, d E A — C. Thus, A — C E AF d c A Since ^ is a 
filter and C and A — C are both in then 0 = C Cl (A — C) E Hence, ^ is 
not a proper filter. 

Exercises 

2.118 Let IF be a proper filter on A and assume that B C A and A — B £ AF . 
Prove that there is a proper filter .AF' A) AF such that B E AF ' . 

2.119 Let AF be a proper filter on A. Prove that AF is an ultrafilter on A if 
and only if, for every B C A, either B E AF or A — B E AF . 

2.120 Let AF be a proper filter on A Show, that AF is an ultrafilter on A if and 
only if, for all B and C in AA(A), if B £ AF and C £ AF, then BU C £ AF. 

2.121 (a) Show that every principal ultrafilter on A is of the form 

AF j = {B\d E B AB C A} for some d in A. 

(b) Show that a non-principal ultrafilter on A contains no finite sets. 

2.122 Let be a filter on A and let be the corresponding ideal: B E F if 
and only if A — B E AF . Prove that AF is an ultrafilter on A if and only if X is 
a maximal ideal. 

2.123 Let X be a chain of proper filters on A , that is, for any B and C in X, 
eithr B C C or C C B. Prove that the union |JA" = {tf|(3Z?)(£ EX Aa E 2?)} 
is a proper filter on A, and B C (JAT for all B in X. 


PROPOSITION 2.40 (ULTRAFILTER THEOREM) 

Every proper filter AF on a set A can be extended to an ultrafilter on A^ 


t\Ve assume the generalized completeness theorem. 
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proof 

Let ^ be a proper filter on A. Let J be the corresponding proper ideal: 
p g J if and only if A — B E & ■ By Proposition 2.36, every ideal can be 
extended to a maximal ideal. In particular, J> can be extended to a maximal 
ideal JY . If we let % = {B\A —Be }, then <?/ is easily seen to be an ul- 
trafilter and C <?/. 

Alternatively, the existence of an ultrafilter including 3F can be proved 
easily on the basis of Zorn’s lemma. (In fact, consider the set A of all proper 
filters fF' such that C # r/ . A is partially ordered by C, and any c-chain in 
A has an upper bound in A, namely, by Exercise 2.123, the union of all 
filters in the chain. Hence, by Zorn’s lemma, there is a maximal element .A* 
in A, which is the required ultrafilter.) However, Zorn’s lemma is equivalent 
to the axiom of choice, which is a stronger assumption than the generalized 
completeness theorem. 


COROLLARY 2.41 

If A is an infinite set, there exists a non-principal ultrafilter on A. 


Proof 

Let 2F be the filter on A consisting of all complements A— B of finite 
subsets B of A (see Exercise 2.115). By Proposition 2.40, there is an ul- 
trafilter 6 tl D JA Assume is a principal ultrafilter. By Exercise 2.121(a), 
% = £F j for some d E A. Then A — (4 e ^ C f , Also, {d} E tfl. Hence, 
0 = (4 n (A — {d}) € fy, contradicting the fact that an ultrafilter is 
proper. 


Reduced direct products 

We shall now study an important way of constructing models. Let K be any 
predicate calculus with equality. Let J be a non-empty set and, for each j in 
J, let M, be some normal model of K. In other words, consider a function F 
assigning to each j in J some normal model. We denote F(j) by My. 

Let 3A be a filter on J . For each j in J, let Dj denote the domain of the 
model Mj. By the Cartesian product Ylj eJ Dj we mean the set of all functions 
/ with domain J such that f(j) C_ Dj for all j in J. If / 6 Ylj^jDj, we shall 
refer to f(J) as the /th component of /. Let us define a binary relation in 
n jeJ D j as follows: 

/ =#■ G if and only if {j\f(J) = g(j)} e & 

If we think of the sets in 2F as being ‘large’ sets, then, borrowing a phrase 
from measure theory, we read f =& g as \f (j) = g(j ) almost everywhere’. 
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It is easy to see that is an equivalence relation: (1) / =jr /; (2) if 
/ =& g then g =jr /; (3) if / g and g =& h, then f =& h. For the proof 

of (3), observe that = g{J)} n {j\g{j) = h{J)} C = h{J)}. If 

{/l/C/) = 0(/)} and {/1 0 (/) = K /')} are in F, then so is their intersection 

and, therefore, also {/(/(/) = *(/)}- 

On the basis of the equivalence relation we can divide into 

equivalence classes: for any / in Yl je jDj, we define its equivalence class fo ? as 
{g\f g}- Clearly, (1) / e fe\ (2) /> = h ? if and only if / =& h; and (3) 
if /jF / then .// fl We denote the set of equivalence classes 

by XljfzjDj/'W. Intuitively, Ylj^jDj/F is obtained from Tl je jDj by identifying 
(or merging) elements of II jsjDj that are equal almost everywhere. 

Now we shall define a model M of K with domain Wj^jDj/F. 


1. Let c be any individual constant of K and let Cj be the interpretation of c 
in My. Then the interpretation of c in M will be /^, where / is the 
function such that f(j) ~ Cj for all j in J . We denote / by {cy} 6j . 

2. Let f k be any function letter of K and let A k be any predicate letter of K. 


Their interpretations (y/) M and {A k y i are defined in the following 
manner. Let (gi)^r, ■ - . , be any members of Xl jG jDj/F. 

(fk) M ((g Ojr, •••, (fti)jr) = h*, where h (j) = {f k ) Mj (g\ (/), . . . , for 
all y in J. 

. . . , ( 0 ,,)^) holds if and only if 
{/'I \=Mj A n k [g\{j), . . . , g„ C/)] ) e F. 

Intuitively, (y/) M is calculated componentwise, and {A k ) u holds if and only 
if A k holds in almost all components. Definitions (a) and (b) have to be 
shown to be independent of the choice of the representatives g \ , . . . , g n in 
the equivalence classes ( 01 )^, . . . , ( g„ )&: if g\ —&■ gl ,. . . , g„ =&■ g * and 

»V) = «?r ww. .... a m, then ©t =, and Oi) oi i=M y 4m), 
••• , On GO] } e&if and only if (yj f= M/ A n k [g\{j ), . . . , g*(j)]} e F. 

Part (i) follows from the inclusion 


M 


(a) 

(b) 


(/lf/i (/) = fftC/)} n . . ■ n OlftiO) = <(/)} £ 

oiw') H (0i o), • • • . ».(/■)> - w) M 'fo:o), ■ • • , g:um 


Part (ii) follows from the inclusions: 

VlGiU) = 9iU)} n • • • n VlffnU) = Sn (J ) } Q 
{/I Hm, A'i[gi(j), ■ ■ ■ ,g»(j )\ if and onl y if Hm, A k[dl(j), ■ ■ 


and 


{/I Hm, A" k \g\{J),..., g„(j)]} n {/I Hm,- A l\g { g„(j)\ if and 

only if |=m, A n k [g\{j ), . . . , g * (/)] C (y| |= My A n k [g\{j), . . . , g*(j)]} 


In the case of the equality relation = , which is an abbreviation for A \ , 




ULTRAPOWERS. NON-STANDARD ANALYSIS 


(A if and only if {/j |=m, A\ffU)M)\} e & 

if and only if {j\g(j) = /*(/)} e & 
if and only if g =& h 

that is, if and only if gg? = h$r. Hence, the interpretation (^) M is the 
identity relation and the model M is normal. 

The model M just defined will be denoted n /& /M y -/#' and will be called a 
reduced direct product. When $F is an ultrafilter, II/ejMy/J* is called an 
ultraproduct. When <F is an ultrafilter and all the M y s are the same model N, 
then ll/ejMy / & is denoted N J / IF and is called an ultrapower. 
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Examples 

1. Choose a fixed element r of the index set J, and let $F be the principal 

ultrafilter EF r — {B\r E B A B C J}. Then, for any/, g in Ylj eJ Dj,f =jr g if 
and only if = g{j)} £ that is, if and only if /(r) = g(r). Hence, 

a member of n ye /D y - f3F consists of all / in n jejDj that have the same rth 
component. For any predicate letter A n k of K and any g \ , . . . , g„ in n ye /D y , 

f=M i • • * ) tf and only if (y| J= (y) , * • • j OniSiW £ A' , 

that is, if and only if f=M r A" k [g\ (y), . . . , g„ (y)] . Hence, it is easy to verify 
that the function (p : Yl /( zjDj /PA — ► /),., defined by (p(g^) = g(r) is an 
isomorphism of n ye jMy/^ with M,-. Thus, when EF is a principal ultra- 
filter, the ultraproduct HygjMy/#' is essentially the same as one of its 
components and yields nothing new. 

2. Let EF be the filter {J}. Then, for any / g in Uj e jDj,f =yr g if and only if 
WU) = &U)} £ $F, that is, if and only if /(y) = g(j) for all j in J, or if 
and only if f = g. Thus, every member of Yl je jDj [PA is a singleton {g} 
for some g in TJ JeJ Dj. Moreover, (j f k ) M {(g\)^, - - . , (g n )&) = {^}, where g 
is such that g(j) = (ft) Mj (gi(j ), • • • , g„(j)) for al l j in J - Aiso > 
Hm ^*[(0iXr> • • . , {g n )sA if and only if |= My A n k [gy{j ), . . . , g„(j)\ for all j in 
J. Hence, n y6 jM y /^ is, in this case, essentially the same as the ordinary 
‘direct product’ n yG jM y , in which the operations and relations are de- 
fined componentwise. 

3. Let .‘F be the improper filter A/(J). Then, for any /, g in Ylj e jDj, f =&■ g if 

and only if = g(j)} £ that is, if and only if (y|/(y) = 

g(J)} E &(J). Thus, / =jr g for all / and g, and n ye jDy/J r consists of 
only one element. For any predicate letter A”., [=m A k [// , . . . , fj r] if and 
only if (y (=m 7 A If Oh ■ • - ,/(/)])} E EA{J\, that is, every atomic wf is true. 

The basic theorem on ultraproducts is due to Los (1955b). 


PROPOSITION 2.42 (LOS’S THEOREM) 

Let FF be an ultrafilter on a set J and let M = n ye /M y - fEF be an ultraproduct. 
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(a) Let 5 = {{cj\ ) i!F , { 02 ) 37 , . • -) be a denumerable sequence of elements of 
Y\j<rjDj j 3. For each j in J, let sy be the denumerable sequence 
(#1(7)5 #2(7)5 • • •) i n Dj . Then, for any wf 3 of K, 5 satisfies 3 in M if 
and only if {y'|sy satisfies 3 in My} G 3 . 

(b) For any sentence 3 of K, 3 is true in fIy e jMy/J r if and only if 
{/| |=m ; $} C 3. (Thus, (b) asserts that a sentence 3 is true in an 
ultraproduct if and only if it is true in almost all components.) 


Proof 


(a) We shall use induction on the number m of connectives and quantifiers in 
3. We can reduce the case m = 0 to the following subcases - .! (ft 
A n k (x h ,. . . ,x,-J; (ii) xt = /* fa, , . . . ,**„); and (iii) x e = a k . For subcase (i), ^ 
satisfies A^{x it , . . .,x im ) if and only if |= M A n k [{g h )^, . . . , (gi„)^], which is 
equivalent to {/| \= Mj A\ [g h (j),---, #/„ 0')] } G 3\ that is {y| sy satisfies 
A n k (xy, , . . . , x in ) in My} G 3 . Subcases (ii) and (iii) are handled in similar 
fashion. 


Now, let us assume the result holds for all wfs that have fewer than m 
connectives and quantifiers. 

Case 1.3 is By inductive hypothesis, 5 satisfies # in M if and only if 
(yjsy satisfies ^ in My} C 3 . s satisfies in M if and only if 

{y|sy satisfies ( £ in My} ^ 3 . But, since 3 is an ultrafilter, the last condition 
is equivalent, by exercise 2.119, to {y'jsy satisfies -fS in My} G 3 . 

Case 2. 3 is ^ A 3. By inductive hypothesis, s satisfies ^ in M if and only 
if {yjsy satisfies <£ in My} G 3, and 5 satisfies 3 in M if and only if 
{y|sy satisfies Q) in My} G 3 . Therefore, 5 satisfies ( £ A 3 if and only if both 
of the indicated sets belong to 3 . But, this is equivalent to their intersection 
belonging to 3 , which, in turn, is equivalent to {y‘|,sy satisfies # A 3 
in My} G 3 . 

Case 3. 3 is Assume 5 satisfies (Et,)^. Then there exists h in 

TljtzjDj such that s' satisfies # in M, where s' is the same as s’ except that hjr 
is the ith component of s' . By inductive hypothesis, s' satisfies $ in M if and 
only if {y|s^- satisfies in My} G 3. Hence, {y|sy satisfies (E\y)^ in My} 
G 3, since, if s'- satisfies ^ in My then sy satisfies (Ex,)# in My. 

Conversely, assume W = {y|sy satisfies(Ex,-)# in My} G 3 . For each j in 
W, choose some s' such that s'. is the same as sy except in at most the 7th 

J J t 

component and s'- satisfies #. Now define h in Ylj^jDj as follows: for j in W, 
let h(j) be the ith component of s'-, and , for j£ W, choose h (j) to be an 


^A wf , /„) can be replaced by (V»i) . . . (Vw„)(«i — t\ A ... A 

u„ = t„ A”{u 1 and a wf x = //’ (fi , . . . , t„) can be replaced by 

(Vzi) . . . (Vz„)(zi = t\ A ... A z n = t„ => x — f k (z 1, . . . , z,,)). In this way, every wf is 
equivalent to a wf built up from wfs of the forms (i)— (iii) by applying connectives and 
quantifiers. 
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arbitrary element of Dj. Let s" be the same as s except that its /th component 
is h&. Then W C {j\s" satisfies # in My} G Ju Hence, by the inductive 
hypothesis, s" satisfies in M. Therefore, s satisfies (3 xdfS in M. 

(b) This follows from part (a) by noting that a sentence 3ft is true in a 
model if and only if some sequence satisfies 3ft. 


COROLLARY 2.43 

If M is a model and & is an ultrafilter on J, and if M* is the ultrapower 
M then M* = M. 


Proof 


Let 38 be any sentence. Then, by Proposition 2.42(b), 3ft is true in M* if and 
only if {j\3ft is true in M } G 3F . If 3ft is true in M, {j\3ft is true in M ) 
— J C_ 3F . If 3ft is false in M, { j\3ft is true in M } = 0 ^ 3P . 

Corollary 2.43 can be strengthened considerably. For each c in the domain 
D of M, let c* stand for the constant function such that c*{j) = c for all j in 
J . Define the function i p such that, for each c in D, i//(c) = (c#)^ G D 1 [33 , 
and denote the range of i}/ by obviously contains the interpretations 

in M* of the individual constants. Moreover, M^ is closed under the op- 
erations ; for (f£) M {(cf)gr , . . . , (cf)^r) is h&, where h(j) = (fk) M 

(c) , . . . , c„) for all j in J , and \ , . . . , c„) is a fixed element b of D. So, 

h $ - — Thus, M # is a substructure of M*. 


COROLLARY 2.44 


i jt is an isomorphism of M with M^, and M# ^ e M 


Proof 

(a) By definition of M^, the range of i// is M # . 

(b) \j/ is one-one. (For any c, d in D, ( c = (d #)# r if and only if 

which is equivalent to (y|c#(/) = d#(j)} € iF; that is, 
{j\c — d] € 2ft . If J, {j\c — d} — 0 £ and, therefore, Me) ^ \j/( d ). 

(c) For any ci , ... , c„ in D, (J%) u * (^(c, ) . . . , }J/{c»)) = (4") ((4 )jr, • ••, 

wh f e h U) = (fp u (4 o), ■ • v 40)) = ■ ■-,£„)■ 

Thus, % - (W) M (ci, . • • , c n )f/S W = <Jr((#) M (ci, . . . , e„)). 

(d) |= M * Al[\j/{ci )..., i j/{c„)] if and only if {j\ |= M Al(i}/(ci)(j), •••, 

)(/'))} G which is equivalent to {/j (= 1 ^ , . . . ,£„)} G JF, that is, 

|=m . . . , c„]. Thus, i p is an isomorphism of M with M # . 


4 * • 
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To see that let SB be any wf and (cf )^, . . . , (cf )# gM # , 

Then, by proposition 2.42(a), (=m* ^[(cf )&■> • • • (cf )jf] if and only if 
01 t=M #[cfOV- v *?(/)]} e J 5 ', which is equivalent to {/( |=m &[ci,... } 
c„]} G S?, which, in turn, is equivalent to SB[c\, . . . , c„], that is, to 
1 =m^ $[( c t O, . . . , (cf Oj, since i// is an isomorphism of M with M#. 

Exercises 

2.124 (The compactness theorem again; see Exercise 2.54) If all finite sub- 
sets of a set of sentences T have a model, then T has a model. 

2.125 

(a) A class if of interpretations of a language ££ is called elementary if 
there is a set T of sentences of S£ such that if is the class of all models 
of T. Prove that if is elementary if and only if if is closed under 
elementary equivalence and the formation of ultraproducts. 

(b) A class if of interpretations of a language «£? will be called sentential if 
there is a sentence SB o f SB such that if is the class of all models of SB. 
Prove that a class if is sentential if and only if both if and its com- 
plement if (all interpretations of not in if) are closed with respect 
to elementary equivalence and ultraproducts. 

(c) Prove that theory K of fields of characteristic 0 (see page 117) is 
axiomatizable but not finitely axiomatizable. 

Non-standard analysis 

■ 

From the invention of the calculus until relatively recent times the idea of 
infinitesimals has been an intuitively meaningful tool for finding new results 
in analysis. The fact that there was no rigourous foundation for in- 
finitesimals was a source of embarrassment and led mathematicians to 
discard them in favour of the rigorous limit ideas of Cauchy and Weier- 
strass. However, about forty years ago, Abraham Robinson discovered that 
it was possible to resurrect infinitesimals in an entirely legitimate and precise 
way. This can be done by constructing models that are elementarily 
equivalent to, but not isomorphic to, the ordered field of real numbers. Such 
models can be produced either by using Proposition 2.33 or as ultrapowers. 
We shall sketch here the method based on ultrapowers. 

Let R be the set of real numbers. Let K be a generalized predicate calculus 
with equality having the following symbols: 

1. For each real number r, there is an individual constant a, . 

2. For every n-ary operation (p on /?, there is a function letter /^. 

3. For every n-ary relation <D on R, there is a predicate letter A®. 

We can think of R as forming the domain of a model i M for K; we simply let 
( a r f = r, ( f v f = if, and (Acf = 
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Let 3F be a non-principal ultrafilter on the set co of natural numbers. We 
can then form the ultrapower = ^S KU /#\ We denote the domain R m /#" of 
by 7?*. By Corollary 2.43, and, therefore, .4?* has all the prop- 

erties formahzable in K that M possesses. Moreover, by Corollary 2.44, 3ft 
has an elementary submodel that is an isomorphic image of 3ft. The 
domain R# of consists of all elements (c#)^ corresponding to the 
constant functions c*(i) = c for all i in co. We shall sometimes refer to the 
members of R# also as real numbers; the elements of R* — R^ will be called 
non-standard reals. 

That there exist non-standard reals can be shown by explicitly exhibiting 
one. Let i(j) = j for all j in w. Then ijs- E R*. However, (c#)^ < ijr for all c 
in R , by virtue of Los’s theorem and the fact that 
{i\c*U) < '0')} = {j\c < y}, being the set of all natural numbers greater 
than a fixed real number, is the complement of a finite set and is, therefore, 
in the non-principal ultrafilter is an ‘infinitely large’ non-standard 

real. (The relation < used in the assertion (c#) ^ < i# is the relation on the 
ultrapower 01* corresponding to the predicate letter < of K. We use the 
symbol < instead of (<) in order to avoid excessive notation, and we shall 
often do the same with other relations and functions, such as u + 1 >, a x v, 
and |n|.) 

Since 0t possesses all the properties of .4? formahzable in K, is an 
ordered field having the real number field 3fft- as a proper subfield. (3ft is 
non-Archimedean: the element defined above is greater than all the 
natural numbers («#) ^ of 3#* .) Let R\, the set of ‘finite’ elements of R*, 
contain those elements z such that \z\ < u for some real number u in R&. (R\ 
is easily seen to form a subring of R *.) Let Rq consist of 0 and the ‘in- 
finitesimals’ of R*, that is, those elements 0 such that \z\ < a for all 
positive real numbers u in R&. The reciprocal 1 /ijr is an infinitesimal.) It is 
not difficult to verify that Rq is an ideal in the ring R\. In fact, since 
x € R\ — Rq implies that l/x E R\ — Ro, it can be easily proved that Rq is a 
maximal ideal in R \ . 


Exercises 

2.126 Prove that the cardinality of R* is 2^°. 

2.127 Prove that the set Rq is closed under the operations of T, — and x. 

2.128 Prove that, if x E R\ and y E Ro, then xy E Rq. 

2.129 Prove that, if x E R\ — Ro, then l/x E Ri — Rq. 

Let x E R\ . Let A = {u\u e R# Ru < x} and B — {i/|« e R& A u > x}. Then 
(A,B) is a ‘cut’ and, therefore, determines a unique real number r such that 
(1) (Vx)(x EA=> x^r) and (2) (Vx)(x E 5 => x^r).^ The difference x — r is 0 

'See Mendelson (1973, chap. 5). 
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or an infinitesimal. (Proof: Assume x — r is not 0 or an infinitesimal. Then 
|x — r\ > r [ for some positive real number r\ . If x > r, then x — r> r \ . So 
x> r + r\ > r. But then r + n EA, contradicting condition (1). If x < r 
then r — x > r \ , and so r > r — r\ > x. Thus, r — r\ E B, contradicting con- 
dition (2).) The real number r such that x — r is 0 or an infinitesimal is called 
the standard part of x and is denoted st(x). Note that, if x is itself a real 
number, then st(x) = x. We shall use the notation x w y to mean 
st(x) = st(y). Clearly, x&y if and only if x — y is 0 or an infinitesimal. If 
x « y, we say that x and y are infinitely close. 


Exercises 

2.130 If x G Ri, show that there is a unique real number r such that x — r is 
0 or an infinitesimal. (It is necessary to check this to ensure that st(x) is well- 
defined.) 

2.131 If x and y are in R\, prove the following. 

(a) st(x + j>) = st(x) 4- st(v) 

(b) st(xj) = st(x)st(y) 

(c) st(— x) — — st(x) A st(y — x) — st(;r) — st(x) 

(d) x ^ 0 st(x) ^ 0 

(e) x s t (x) < st (y) 

The set of natural numbers is a subset of the real numbers. Therefore, in 
the theory K there is a predicate letter N corresponding to the property 
x G co. Hence, in R*, there is a set co* of elements satisfying the wf N(x). An 
element of R* satisfies N(x) if and only if {j\f(j) G co} G ZF. In particular, 
the elements «|r, for n G co, are the ‘standard’ members of of, whereas i 
for example, is a ‘non-standard’ natural number in R*. 

Many of the properties of the real number system can be studied from the 
viewpoint of non-standard analysis. For example, if s is an ordinary denu- 
merable sequence of real numbers and c is a real number, one ordinarily says 
that lim s„ = c if 

(&) (Ve)(e > 0 =>- (3«)(w G co A (Vk)(k G co A k^n => — c\ < e ))) 

Since s G R aj , s is a relation and, therefore, the theory K contains a predicate 
letter S(n,x) corresponding to the relation s„ = x. Hence, R* will have a 
relation of all pairs (n,x) satisfying S(n,x). Since {ft = this relation will 
be a function that is an extension of the given sequence to the larger domain 
oo*. Then we have the following result. 




PROPOSITION 2.45 


Let s be a denumerable sequence of real numbers and c a real number. Let 
denote the function from co* into R* corresponding to s in 0*. Then 
jj j-n S)l — c if and only if s*(n) ss c for all n in co* — co. (The latter condition 
can be paraphrased by saying that 5*(n) is infinitely close to c when n is 
infinitely large.) 


proof 

Assume lims,, = c. Consider any positive real e. By (&), there is a natural 
number no such that (Vk)(k £ co A /c^«o =4> — c| < e) holds in 0. Hence, 

the corresponding sentence (\/k){k E co* A k^ no ^ |s* (k) — c\ < e) holds in 
01* . For any n in co* — co, n > no and, therefore, \s*(n) — c\ < e. Since this 
holds for all positive real e, s*(n) — c is 0 or an infinitesimal. 

Conversely, assume s* (n ) « c for all n e co* — co. Take any positive real e. 
Fix some n\ in co* — co. Then (\fk){k^ni |s*(k) — c| < e). So the sentence 
(3«)(« £ co A (Vk)(k £ co Ak^n => \sk — c\ < a)) is true for 0* and, there- 
fore, also for 0. So there must be a natural number /?o such that 
Nk){k £ co A no =>• \sk — c\ < g). Since e was an arbitrary positive real 
number, we have proved lim s„ = c. 


Exercise 

2.132 Using Proposition 2.45, prove the following limit theorems for the 
real number system. If s and u are denumerable sequences of real num- 
bers and c\ and C2 are real numbers such that lim s„ = c\ and lim u n = c 2 , 
then: 

(a) lim^,, 4- u„) = ci + c 2 \ 

(b) lim(j B M„) = cyc 2 ; 

(c) If c 2 7^ 0 and all u„ ^ 0, lim (s„ / u ,, ) = c\fc 2 . 

Let us now consider another important notion of analysis, continuity. Let 
B be a set of real numbers, let c £ B, and let / be a function defined on B and 
taking real values. One says that f is continuous at c if 

(1) (Ve)(e > 0 -> (3<5)(<5 > 0 A (Vx ) (x £ B A \x — c| < <5 -> |f(x) — f(c)| < e))) 
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PROPOSITION 2.46 

Let f be a real- valued function on a set B of real numbers. Let c G B. Let B* 
be the subset of R* corresponding to B, and let /* be the function corre- 
sponding to f) Then / is continuous at c if and only if (Vx) (x g B* 
A x«c =>f*(x) «/(<?)). 


Exercises 

2.133 Prove Proposition 2.46. 

2.134 Assume / and g are real-valued functions defined on a set B of real 
numbers and assume that / and g are continuous at a point c in B. Using 
Proposition 2.46, prove the following. 

(a) f + g is continuous at c. 

(b) / • g is continuous at c. 

2.135 Let f be a real- valued function defined on a set B of real numbers 
and continuous at a point c in B, and let g be a real-valued function defined 
on a set A of real numbers containing the image of B under /. Assume that g 
is continuous at the point f(c). Prove, by Proposition 2.46, that the com- 
position go f is continuous at c. 

2.136 Let CCR. 

(a) C is said to be closed if (Vx)((Ve)[£ > 0 (3y)(j’ G C A |x — y| < 

e)] a.yG C). Show that C is closed if and only if every real number that 
is infinitely close to a member of C* is in C. (b) C is said to be open if 
(Vx) (x G C (3<5)(<5 > 0 A (Vy)(|y — xj < <5 =£> y G C))). Show that C is 
open if and only if every non-standard real number that is infinitely 
close to a member of C is a member of C*. 

Many standard theorems of analysis turn out to have much simpler 
proofs within non-standard analysis. Even stronger results can be obtained 
by starting with a theory K that has symbols, not only for the elements, 
operations and relations on R, but also for sets of subsets of R, sets of sets of 
subsets of R, and so on. In this way, the methods of non-standard analysis 
can be applied to all areas of modern analysis, sometimes with original and 
striking results. For further development and applications, see A. Robinson 
(1966), Luxemburg (1969), Bernstein (1973), Stroyan and Luxemburg 
(1976), and Davis (1977a). A calculus textbook based on non-standard 
analysis has been written by Keisler (1976) and has been used in some 
experimental undergraduate courses. 


Uo be more precise, / is represented in the theory K by a predicate letter A/-, 
where Af(x,y) corresponds to the relation f(x) — y. Then the corresponding relation 
Ay in R* determines a function f* with domain B*. 
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Exercises 

2.137 A real-valued function / defined on a closed interval [a,b] 
-s is said to be uniformly continuous if 


(Ve)(e > 0 => (3<5)((5 > 0 A (Vx) (\/y) (a < x ^ b A a^y^b A |x —y\ <8 

=> l/W -fly ) I < e))) 

prove that f is uniformly continuous if and only if, for all x and y in [a, b]*, 

2.138 Prove by non-standard methods that any function continuous on 
f a ,b\ is uniformly continuous on [a, b]. 


2.15 SEMANTIC TREES 


Remember that a wf is logically valid if and only if it is true for all inter- 
pretations. Since there are uncountably many interpretations, there is no 
simple direct way to determine logical validity. Godel’s completeness the- 
orem (Corollary 2.19) showed that logical validity is equivalent to deriva- 
bility in a predicate calculus. But, to find out whether a wf is provable in a 
predicate calculus, we have only a very clumsy method: start generating the 
theorems and watch to see whether the given wf ever appears. Our aim here 
is to outline a more intuitive and usable approach in the case of wfs without 
function letters. Throughout this section, we assume that no function letters 
occur in our wfs. 

A wf is logically valid if and only if its negation is not satisfiable. We shall 
now explain a simple procedure for trying to determine satisfiability of a 
closed wf SB.^ Our purpose is either to show that SB is not satisfiable or to 
find a model for SB. 

We shall construct a figure in the shape of an inverted tree. Start with the 
wf^at the top (the ‘root’ of the tree). We apply certain rules for writing wfs 
below those already obtained. These rules replace complicated wfs by sim- 
pler ones in a way that corresponds to the meaning of the connectives and 
quantifiers. 


^Remember that a wf is logically valid if and only if its closure is logically valid. 
So it suffices to consider only closed wfs. 
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Universal quantifier: 

(Vx)X’(x) 

(Rule U) 


1 

[Here, b is any individual 


9(b) 

constant already present.] 

Existential quantifier: 

(3x)V(x) 



i 

[c is a new individual 


9(c) 

constant not already in 
the figure.] 


Note that some of the rules require a fork or branching. This occurs when 
the given wf implies that one of two possible situations holds. 

A branch is a sequence of wfs starting at the top and proceeding down the 
figure by applications of the rules. When a wf and its negation appear in a 
branch, that branch becomes closed and no further rules need be applied to 
the wf at the end of the branch. Closure of a branch will be indicated by a 
large cross X . 

Inspection of the rules shows that, when a rule is applied to a wf, the 
usefulness of that wf has been exhausted (the formula will be said to be 
discharged) and that formula need never be subject to a rule again, except in 
the case of a universally quantified wf. In the latter case, whenever a new 
individual constant appears in a branch below the wf, rule U can be applied 
with that new constant. In addition, if no further rule applications are possible 
along a branch and no individual constant occurs in that branch, then we must 
introduce a new individual constant for use in possible applications of rule U 
along that branch. (The idea behind this requirement is that, if we are trying 
to build a model, we must introduce a symbol for at least one object that can 
belong to the domain of the model.) 
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0ASIC PRINCIPLE OF SEMANTIC TREES 

jf all branches become closed, the original wf is unsatisfiable. If, however, a 
branch remains unclosed, that branch can be used to construct a model in 
which the original wf is true; the domain of the model consists of the in- 
dividual constants that appear in that branch. 

We shall discuss the justification of this principle later on. First, we shall 
give examples of its use. 


Examples 

1. To prove that (Vx)^(x) ^(b) is logically valid, we build a semantic tree 

starting from its negation. 


(i) -.((Vx)^(x) <E{b)) 

(ii) (Vx)^'(x) 

(i) 

(iii) ~M{b) 

(i) 

(iv) <&{b) 

X 

(ii) 

The number to the right of a given wf indicates the number of the line of the 
wf from which the given wf is derived. Since the only branch in this tree is 
closed, -i((Vx)^(x) =r^(fc)) is unsatisfiable and, therefore, (Vx)#(x) => ^(fc)) 

is logically valid. 

2. (i) ->[(Vx)(#(x) @(x)) => ((Vx)#(x) 

=> (Vx)^(x))] 

(ii) (Vx)(^(x) => 3}(x)) 

(i) 

(iii) -i((Vx)^(x) (Vx)^(x)) 

(i) 

(iv) (Vx)^(x) 

(iii) 

(v) -i(Vx).S?(a-) 

(iii) 

(vi) (3x)-i^(x) 

(v) 

(vii) -i Qj{b) 

(vi) 

(viii) b ) 

(iv) 

(ix) ^( b ) 3>{b ) 

(ii) 

X \ 


(x) ~^{b) @(b) 

(ix) 

X X 


Since both branches are closed, the original wf (i) is unsatisfiable and, 

therefore, (Vx)(#(x) £^(x)) => ((Vx)^(x) 

3. (i) “’[(3x)yl}(x) => (Vx)y4|(x)] 

=> (Vx)^(x)) is logically valid. 

(ii) (3x)v4](x) 

(0 

(iii) -i(Vx)/l j (x) 

(i) 

(iv) A\{b) 

(ii) 

(v) (3x)-u4j (x) 

(iii) 

(vi) -A\{c) 

(v) 




No further applications of rules are possible and there is still an opejf 
branch. Define a model M with domain {b, c} such that the interpretation of 
A } holds for b but not for c. Thus, (3x)-u4{(x) is true in M but (Vx)/1 } (x) j s 
false in M. Hence, (3x)vl] (x) => (Vx)ytj (x) is false in M and is, therefore, not? 
logically valid. 


4. (i) -•[(3 p)(Vx)^(x,p) =s> (Vx)(3y)#(x,j>)] 


(ii) 


(i) 

(iii) 

-^(Vx)(3y)@(x,y) 

(i) 

(iv) 

(Vx)^(x, b) 

(ii) 

(v) 

(3x)-*(3 y)^{x,y) 

(iii) 

(vi) 

®{b, b) 

(iv) 

(vii) 

^(3y)@(c,y) 

(v) 

(viii) $(c,b) 

(iv) 

(ix) 

(Vy)^(c,p) 

(vii) 

(x) 

— »^(c, b ) 

(ix) 


X 

Hence, (3y)(Vx)^?(x,_y) (yx)(3y)&(x,y) is logically valid. 

Notice that, in the last tree, step (vi) served no purpose but was required 
by our method of constructing trees. We should be a little more precise in 
describing that method. At each step, we apply the appropriate rule to each 
undischarged wf (except universally quantified wfs), starting from the top of 
the tree. Then, to every universally quantified wf on a given branch we apply 
rule U with every individual constant that has appeared on that branch since 
the last step. In every application of a rule to a given wf, we write the 
resulting wf(s) below the branch that contains that wf. 

5. (i) ->[(Vx)^(x) => (3x)^(x)] 


(ii) (Vx)^(x) (i) 

(iii) ->(3x)^(x) (i) 

(iv) (Vx)^(x) (iii) 

(v) ®(b) (ii)* 

(vi) ^{b) (iv) 

X 

Hence, (Vx)^(x) => (3x)^(x) is logically valid. 

6. (i) -i[(Vx)-</4j(x,x) =4- (3x)(Vy)^4f(x,p)] 

(ii) (Vx)-i<4|(x,x) (i) 

(iii) -.(ax)(Vy)^(x,p) (ii) 

(iv) (Vx)^(Vy)^4?(x,p) (iii) 

(v) ->Aj a t ) (ii)* 

(vi) (a h y) (iv) 


t Here, we must introduce a new individual constant for use with rule U since, 
otherwise, the branch would end and would not contain any individual constants. 
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(vii) {3y)^A]{a u y) 

(vi) 

(viii) ->->A\{a\ , aj) 

(vii) 

(ix) A\{a\,a 2 ) 

(viii) 

(x) ->Aj{a 2 ,a 2 ) 

(ii) 

(xi) ~ i (^y)~'A}{a 2 ,y) 

(iv) 

(xii) (3j’) ->->A] {a 2 , j’) 

(xi) 

(xiii) -i-A* ( 02 , 03 ) 

(xii) 

(xiv) A]{ci 2 ,ai) 

(xiii) 


AVe can see that the branch will never end and that we will obtain a sequence 
of constants ay , a 2 , . . . with wfs A^{a„, a„ + y) and -A*(a„, a„). Thus, we con- 
struct a model M with domain and we define (^^) M to contain 

only the pairs (a„,a n+ \). Then, (Vx)-u4j(x,x) is true in M, whereas 
(Tt) (Vp) ->A\ (x, p) is false in M. Hence, (Vx) ->A\ (x, x) => (3x)(Vp)-u4f(x, j;) is 
not logically valid. 

Exercises 

2.139 Use semantic trees to determine whether the following wfs are logi- 
cally valid. 

(a) (Vx ) (A j (x) VA\(x)) => ( (Vx)// J (x) ) V (Vx)A ] 2 (x) 

(b) ((Vx).^(x)) A (Vx)^(x) (Vx) {88{x) A 'A(x)). 

(c) {Vx){88{x) A tf(x)) : => ( (Vx) 88 (x) ) A (Vx)^(x) 

(d) (3x) (A j (x) => A\ (x) ) =^> ( {3x)A \ (x) => ( 3x)A\ (x) ) 

(e) ( 3x) ( 3 \y)A\ (x, y) => ( d z)A] (z, z) 

(f) ((Vx)/J. J (x)) V (Vx)A\(x) => (Vx)(/tj(x) V A 2 (x) ) 

(g) (3x)(3>')(^?(x,j') => {Vz)A\{z,y)) 

(h) The wfs of Exercises 2.24, 2.31 (a, e, j), 2.39 and 2.40. 

(i) The wfs of Exercise 2.21 (a, b, g). 


PROPOSITION 2.47 

Assume that T is a set of closed wfs that satisfy the following closure con- 
ditions: (a) if -1-1,08 is in T, then 8ft is in T; (b) if —>{88 V ^) is in T, then —>88 
and -f& are in T; (c) if ->{88 ^) is in T, then 88 and —f4 are in T; (d) if 

->(Vx)^ is in T, then (Ebc)-i^ is in T; (e) if ->{3x)88 is in T, then {\fx)->88 is in 
T; (f) if —>{88 A ^) is in F, then at least one of —>88 and is in T; (g) if 
->{88 *&) is in F, then either 8ft and —f£ are in F, or —>88 and ^ are in F; (h) 

if 88 A ^ is in F, then so are 88 and (i) if 88 V # is in T, then at least one of 
88 and *8 is in T, (j) if 88 =^> <8 is in T, then at least one of —>8ft and <8 is in T; 
(k) if 8ft ^ is in T, then either 88 and are in T or —>88 and —f€ are in T ; (1) 
if Vx)^(x) is in F, then 88(b) is in F (where b is any individual constant that 
occurs in some wf of T); (m) if (3x)^(x) is in T, then 88{b) is in F for some 
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individual constant b. If no wf and its negation both belong to T and sortie 
wfs in r contain individual constants, then there is a model for F whose 
domain is the set D of individual constants that occur in wfs of T. 


Proof 

Define a model M with domain D by specifying that the interpretation of any 
predicate letter A' l k in F contains an /7-tuple (b i,..., b„) if and only if 
A k {by , . . . , b„) is in F. By induction on the number of connectives and 
quantifiers in any closed wf £, it is easy to prove: (i) if £ is in T, then £ is trui 
in M; and (ii) if ->£, is in T, then £ is false in M. Hence, M is a model for T, 
If a branch of a semantic tree remains open, the set F of wfs of that 
branch satisfies the hypotheses of Proposition 2.47. If follows that, if a 
branch of a semantic tree remains open, then the set T of wfs of that branch 
has a model M whose domain is the set of individual constants that appear 
in that branch. This yields half of the basic principle of semantic trees. 


PROPOSITION 2.48 

If all the branches of a semantic tree are closed, then the wf 8ft at the root of 
the tree is unsatisfiable. 


Proof 

From the derivation rules it is clear that, if a sequence of wfs starts at 8ft and 
continues down the tree through the applications of the rules, and if the wfs 
in that sequence are simultaneously satisfiable in some model M, then that 
sequence can be extended by another application of a rule so that the added 
wf(s) would also be true in M. Otherwise, the sequence would form an 
unclosed branch, contrary to our hypothesis. Assume now that 8ft is sa- 
tisfiable in a model M. Then, starting with 8ft, we could construct an infinite 
branch in which all the wfs are true in M. (In the case of a branching rule, if 
there are two ways to extend the sequence, we choose the left-hand wf.) 
Therefore, the branch would not be closed, contrary to our hypothesis. 
Hence, 8ft is unsatisfiable. 

This completes the proof of the basic principle of semantic trees. Notice 
that this principle does not yield a decision procedure for logical validity. If 
a closed wf 8ft is not logically valid, the semantic tree of ->8ft may (and often 
does) contain an infinite unclosed branch. At any stage of the construction 
of this tree, we have no general procedure for deciding whether or not, at 
some later stage, all branches of the tree will have become closed. Thus, we 
have no general way of knowing whether 8ft is unsatisfiable. 
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for the sake of brevity, our exposition has been loose and imprecise. A 
clear and masterful study of semantic trees and related matters can be found 
jii Smullyan (1968). 
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2.16 QUANTIFICATION THEORY ALLOWING EMPTY DOMAINS 

0 lir definition in Section 2.2 of interpretations of a language assumed that 
the domain of an interpretation is non-empty. This Was done for the sake of 
simplicity. If we allow the empty domain, questions arise as to the right way 
of defining the truth of a formula in such a domain. t Once that is decided, 
the corresponding class of valid formulas (that is, formulas true in all in- 
terpretations, including the one with an empty domain) becomes smaller, 
and it is difficult to find an axiom system that will have all such formulas as 
its theorems. Finally, an interpretation with an empty domain has little or 
no importance in applications of logic. 

Nevertheless, the problem of finding a suitable treatment of such a more 
inclusive logic has aroused some curiosity and we shall present one possible 
approach. In order to do so, we shall have to restrict the scope of the 
investigation in the following ways. 

First, our languages will contain no individual constants or function 
letters. The reason for this restriction is that it is not clear how to interpret 
individual constants or function letters when the domain of the interpreta- 
tion is empty. Moreover, in first-order theories with equality, individual 
constants and function letters always can be replaced by new predicate 
letters, together with suitable axioms.! 

Second, we shall take every formula of the form (Vx)£$(x) to be true in the 
empty domain. This is based on parallelism with the case of a non-empty 
domain. To say that (Vx)^(jc) holds in a non-empty domain D amounts to 
asserting 

(*) for any object c, if c 6 D, then 

When D is empty, ‘c € D’ is false and, therefore, ‘if c € D, then ^(c)’ is true. 
Since this holds for arbitrary c, (*) is true in the empty domain D, that is, 
(Vjc)^(x) is true in an empty domain. Not unexpectedly, (3x)&(x) will be 
false in an empty domain, since (3x)&(x) is equivalent to -.(Vx)-^(jt). 

These two conventions enable us to calculate the truth value of any closed 
formula in an empty domain. Every such formula is a truth-functional 
combination of formulas of the form (Vx)^(jc). Replace every subformula 

iFor example, should a formula of the form (V.v) (A [ (x) a -\A [ (a - )) be considered 
true in the empty domain? 

iFor example, an individual constant b can be replaced by a new monadic 
predicate letter P, together with the axiom (3 \y)(\/x)(P(x) o- x = y). Any axiom 3${b) 
should be replaced by (Vx)(P(x) => f%(x)). 
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(Va')^(x) by the truth value T and then compute the truth value of the whole 
formula. 

It is not clear how we should define the truth value in the empty domain 
of a formula containing free variables. We might imitate what we do in the 
case of non-empty domains and take such a formula to have the same truth 
values as its universal closure. Since the universal closure is automatically 
true in the empty domain, this would have the uncomfortable consequence 
of declaring the formula A\ (x) A ->A | (x) to be true in the empty domain. For 
this reason, we shall confine our attention to sentences, that is, formulas 
without free variables. 

A sentence will be said to be inclusively valid if it is true in all intefi 
pretations, including the interpretation with an empty domain. Every in* 
clusively valid sentence is logically valid, but the converse does not hold. To 
see this, let f stand for a sentence A ->#, where is some fixed sentence* 
Now, f is false in the empty domain but (Vx)f is true in the empty domain 
(since it begins with a universal quantifier). Thus the sentence (Vx)f => f j s 
false in the empty domain and, therefore, not inclusively valid. However, it 
is logically valid, since every formula of the form (Vx)® => ® is logically 
valid. 

The problem of determining the inclusive validity of a sentence is re- 
ducible to that of determining its logical validity, since we know how to 
determine whether a sentence is true in the empty domain. Since the problem 
of determining logical validity will turn out to be unsolvable (by Proposition 
3.54), the same applies to inclusive validity. 

Now let us turn to the problem of finding an axiom system whose the- 
orems are the inclusively valid sentences. We shall adapt for this purpose an 
axiom system PP # based on Exercise 2.28. As axioms We take all the fol- 
lowing formulas (see the Logical Axioms on p. 69): 

(Al) ® => (%> => 0) 

(A2) => (tf => 0)) ^ ((# => tf) => {® =* &)) 

(A3) (—i^ =$■ —*0) (( -1< ^ ? ®) => *<£?) 

(A4) (Wx)®(x) => ®{y) if ®(x) is a wf of Jzf and y is a variable that is free 

for x in ® (x ) . (Recall that, if y is x itself, then the axiom has the form 
( Vx )® =^> ®. In addition, x need not be free in ®(x).) 

(A5) (Vx)(® (Yx) r A) if ® contains no free occurrences of x. 

(A6) (Vy,) . . . (Vy„)(® => <f) [(Vy,) . . . (Vy„)^ => (Vj-i) . . . (Vj *)«] 

together with all formulas obtained by prefixing any sequence of universal 
quantifiers to instances of (A1)-(A6). 

Modus ponens (MP) will be the only rule of inference. 

PP denotes the pure first-order predicate calculus, whose axioms are 
(A1)-(A5), whose rules of inference are MP and Gen, and whose language 
contains no individual constants or function letters. By Godel’s complete- 
ness theorem (Corollary 2.19), the theorems of PP are the same as the 
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logically valid formulas in PP. Exercise 2.28 shows first that Gen is a derived 
rule of inference of PP#, that is, if h PP # 3ft, then |- PP # (Vx)^, and second that 
pp and PP# have the same theorems. Hence, the theorems of PP# are the 
logically valid formulas. 

Let PPS# be the same system as PP# except that, as axioms, we take only 
the axioms of PP# that are sentences. Since MP takes sentences into sen- 
tences, all theorems of PPS# are sentences. Since all axioms of PPS# are 
axioms of PP#, all theorems of PPS# are logically valid sentences. Let us 
show that the converse holds. 
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PROPOSITION 2.49 

Every logically valid sentence is a theorem of PPS#. 


Proof 

Let 3ft be any logically valid sentence. We know that 3ft is a theorem of PP# . 
Let us show that 3ft is a theorem of PPS#. In a proof of 3ft in PP#, let 
;<i, . . . , u„ be the free variables (if any) in the proof, and prefix (V«i) . . . (Vw„) 
to all steps of the proof. Then each step goes into a theorem of PPS#. To see 
this, first note that axioms of PP# go into axioms of PPS#. Second, assume 
that 3ft comes from # and 'Y => 3ft by MP in the original proof and that 
=(Vi/i) • • • (Vw,,)^ and (V/q) . . . (Vw„)(# =>• 3ft) are provable in PPS#. Since 
(Vwi) • • • (Vw„)(# => 3ft) => [(Vwi) . . . (Vm„)# => (Vwj ) . . . {\/u n )3ft ] is an in- 
stance of axiom (A6) of PPS#, it follows that (Vwi) . . . (\/u„)3ft is provable in 
PPS#. Thus, (Vwi) . . . (’ Vu„)3 d is a theorem of PPS#. Then n applications of 
axiom (A4) and MP show that 3ft is a theorem of PPS# . 

Not all axioms of PPS# are inclusively valid. For example, the sentence 
(Vx)f => f discussed earlier is an instance of axiom (A4) that is not in- 
clusively valid. So, in order to find an axiom system for inclusive validity, we 
must modify PPS#. 

If P is a sequence of variables u \ , . . . , u,„ then by VP we shall mean the 
expression (Viq) . . . (Vw„). 

Let the axiom system ETH be obtained from PPS# by changing axiom 
(A4) into: 

(A4 r ) All sentences of the form VP[(Vx)3ft(x) 3ft (y)\, where y is free for x 

in 3ft (x) and x is free in 3ft (x), and P is a sequence of variables that 
includes all variables free in 3ft (and possibly others). 

MP is the only rule of inference. 

It is obvious that all axioms of ETH are inclusively valid. 




QUANTIFICATION THEORY 


150 


LEMMA 2.50 


If ST is an instance of a tautology and P is a sequence of variables that 
contains all free variables in ST , then F E th VP^. 


Proof 

By the completeness of axioms (A1)-(A3) for the propositional calculus 
there is a proof of ST using MP and instances of (A1)-(A3). If we prefix VP 
to all steps of that proof, the resulting sentences are all theorems of ETH. In? 
the case when an original step SB was an instance of (A1)-(A3), MPSB is an 
axiom of ETH. For steps that result from MP, we use axiom (A6). 


LEMMA 2.51 

If P is a sequence of variables that includes all free variables of SB ( S, and 
Feth yP£B and Feth \!P\SB < P >\ , then I~eth YP^F 

Proof 

Use axiom (A6) and MP. 

LEMMA 2.52 

If P is a sequence of variables that includes all free variables of SB, C S, SB, and 
Feth VP[# => S>\ and Feti ] VP [P => fBj, then FethYP[^ SS \ . 

Proof 

Use the tautology {SB => =^> {{<$ =^> SB) {SB => SB)), Lemma 2.50, and 

Lemma 2.51 twice. 


LEMMA 2.53 

If x is not free in SB and P is a sequence of variables that contains all free 
variables of SB, Feth VP[SB {Vx)SB]. 

Proof 

By axiom (A5), F et h YP[(Yr)(B => B) => (B (Yx)B)]. By Lemma 2.50, 
Feth VP[(Vx ) {SB SB)]. Now use Lemma 2.51. 
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COROLLARY 2.54 

If 0 has no free variables, then Feth => (Vx)0?. 


LEMMA 2.55 

If x is not free in 38 and P is a sequence of variables that includes all 
variables free in 38, then Feth VP[— '(Vx)f => ({\jx)38 => 01)] . 

Proof 

Feth MP\p38 => (38 => f)] by Lemma 2.50. By Lemma 2.53, Feth VP 
1(0/ => f) (Vx) (38 => f)]. Hence, by Lemma 2.52, Feth VP[—<& => (Vx) 
(0 => f)] . By axiom (A6), F E th VP[(\/x)(38 =>!)=> ((Vx)^ (Vx)f)]. 

Hence, by Lemma 2.52, Feth VP[->,# => ((\fx)38 => (Vx)f)]. Since [-^ => 
((Vx)^? => (Vx)f)] => [-i(Vx)f => ((Vx)38 => ,#)] is an instance of a tautology. 
Lemmas 2.50 and 2.51 yield Feth VP[-i(Vx)f =» ((\/x)38 => 38 )]. 


PROPOSITION 2.56 

ETH + (-i(Vx)f} is a complete axiom system for logical validity, that is, a 
sentence is logically valid if and only if it is a theorem of the system. 

Proof 

All axioms of the system are logically valid. (Note that (Vx)f is false in all 
interpretations with a non-empty domain and, therefore, -t(Vx)f is true in all 
such domains.) By Proposition 2.49, all logically valid sentences are pro- 
vable in PPS 7 ^. The only axioms of PPS^ missing from ETH are those of the 
form VP[(Vx),# => 38], where x is not free in 38 and P is any sequence of 
variables that include all free variables of 38. By Lemma 2.55, 
Fern VP[->(Yx)f => ((\/x)38 =t> 38)]. By Corollary 2.54, VP[-i(Vx)f] will be 
derivable in ETH + {-i(Vx)f}. Hence, VP[(Vx).$$ => 38] is obtained by using 
axiom (A6). 


LEMMA 2.57 


If P is a sequence of variables that include all free variables of 38, 
Feth VP[(Vx)f => ((Vx)38 <=> t)], where t is -if. 
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Proof 

Since f => 88 is an instance of a tautology. Lemma 2.50 yields 
f ETH VP(Vx:)[f => 88). By axiom (A6), t- ETH VP[(Vjc) [f => 88) [(Vx)f 
=*> (V 'x)0fl. Hence, F ETH VP[(Vx;)f =$■ (Vx)0\ by Lemma 2.51. Since 
Nx)88 => [(Vx)88 <£> t] is an instance of a tautology. Lemma 2.50 yields 
F E th VP[(Vx) 3V => [{\Jx)88 t]]. Now, by Lemma 2.52, h ETH VP[(Vx)f 

=» [(Vjc)^ t]]. 

Given a formula 88, construct a formula 88* in the following way. Moving 
from left to right, replace each universal quantifier and its scope by t. 


LEMMA 2.58 

If P is a sequence of variables that include all free variables of 88, then 
h EX H VP[(Vx)f => [88 ^ &*]] • 

Proof 

Apply Lemma 2.57 successively to the formulas obtained in the stepwise 
construction of 88* . We leave the details to the reader. 


PROPOSITION 2.59 

ETH is a complete axiom system for inclusive validity, that is, a sentence 38 
is inclusively valid if and only if it is a theorem of ETH. 


Proof 

Assume 88 is a sentence valid for all interpretations. We must show that 
t- ETH $8- Since 88 is valid in all non-empty domains, Proposition 2.56 implies 
that 88 is provable in ETH + {-i(Vjc)f}. Hence, by the deduction theorem, 

(+) F eee "i(Vx)f =£■ 88 . 

Now, by Lemma 2.58, 

(%) F eto (Vx)f => [88 88*} 

(Since 88 has no free variables, we can take P in Lemma 2.58 to be empty.) 
Hence, [(Vx)f \88 <=> 88*)) is valid for all interpretations. Since (Vx)f is 
valid in the empty domain and 88 is valid for all interpretations, 88* is 
valid in the empty domain. But 88* is a truth-functional combination of ts. 
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go, 38* must be truth-functionally equivalent to either t or f. Since it is 
valid in the empty domain, it is truth-functionally equivalent to t. Hence, 
Leth 38*. Therefore by (%), F E th (Vx)f => 38. This, together with (+), 
yields Feth 38. 

The ideas and methods used in this section stem largely, but not entirely, 
from a paper by Hailperin (1953). * That paper also made use of an idea in 
Mostowski (1951b), the idea that underlies the proof of Proposition 2.59. 
Mostowski’s approach to the logic of the empty domain is quite different 
from Hailperin’s and results in a substantially different axiom system for 
inclusive validity. For example, when 38 does not contain x free, Mostowski 
interprets ( Yx ) 38 and (3 x)38 to be 38 itself. This makes (Vx)f equivalent to f, 
rather than to t, as in our development. 


*The name ETH comes from ‘empty domain’ and ‘Theodore Hailperin’. My 
simplification of Hailperin’s axiom system was suggested by a similar simplification 
in Quine (1954). 






3.1 AN AXIOM SYSTEM 

Together with geometry, the theory of numbers is the most immediately 
intuitive of all branches of mathematics. It is not surprising, then, that 
attempts to formalize mathematics and to establish a rigorous foundation 
for mathematics should begin with number theory. The first semi- axiomatic 
presentation of this subject was given by Dedekind in 1879 and, in a slightly 
modified form, has come to be known as Peano’s postulates.! It can be 
formulated as follows: 

(P1)0 is a natural number.! 

(P2) If x is a natural number, there is another natural number denoted by x' 
(and called the successor of x). 

(P3) 0 f x' for every natural number x. 

(P4) If x' = /, then x—y. 

(P5) If Q is a property that may or may not hold for any given natural 
number, and if (I) 0 has the property Q and (II) whenever a natural number 
x has the property Q, then x J has the property Q, then all natural numbers 
have the property Q (mathematical induction principle). 

These axioms, together with a certain amount of set theory, can be used 
to develop not only number theory but also the theory of rational, real and 
complex numbers (see Mendelson, 1973). However, the axioms involve 
certain intuitive notions, such as ‘property’, that prevent this system from 
being a rigorous formalization. We therefore shall build a first-order theory 
S that is based upon Peano’s postulates and seems to be adequate for the 
proofs of all the basic results of elementary number theory. 

The language S£ a of our theory S will be called the language of arithmetic. 
££ a has a single predicate letter Tf. As usual, we shall write t = s for Aj(t,s). 
£? a has one individual constant a \ . We shall use 0 as an alternative notation 

tFor historical information, see Wang (1957). 

!'fhe natural numbers are supposed to be the non-negative integers 0, 1,2, 

*The intuitive meaning of x' is x + 1 . 




AN AXIOM SYSTEM 


for ci\- Finally, A has three function letters, f\ , f\ and / 2 2 . We shall 
write (/) instead of j\ (t), (t + 5 ) instead of fi(t,s)> and (t • s) instead of 
However, we shall write t 1 , t + s, and t ■ s instead of ( t '), (t + s), and 
(/ • 5 ) whenever this will cause no confusion. 

The proper axioms of S are: 
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(SI) 

XI = X2 => ( X 1 

— x 3 => x 2 = 

--X 3 ) 

(S2) 

Xl — x 2 =>• x\ = 



(S3) 

0 f x\ 



(S4) 

x , [ — x' 2 =4> X\ = 

- x 2 


(S5) 

xi + 0 = xi 

A 


(S6) 

X] +X 2 = (X! + X 2 ) 


(S7) 

Xl • 0 = 0 

A 



(S8) 

X] • (* 2 ) = (*1 

•X 2 ) +Xi 


(S9) 

^(0) => ((Vx)(^(x) => &{x')) =>- (Vx)^(x)) for any wf &{x) of S 


We shall call (S9) the principle of mathematical induction. Notice that 
axioms (S1)-(S8) are particular wfs, whereas (S9) is an axiom schema pro- 
viding an infinite number of axioms. ^ 

Axioms (S3) and (S4) correspond to Peano postulates (P3) and (P4), 
respectively. Peano’s axioms (PI) and (P2) are taken care of by the presence 
of 0 as an individual constant and // as a function letter. Our axioms (SI) 
and (S2) furnish some needed properties of equality; they would have been 
assumed as intuitively obvious by Dedekind and Peano. Axioms (S5)-(S8) 
are the recursion equations for addition and multiplication. They were not 
assumed by Dedekind and Peano because the existence of operations + and 
• satisfying (S5)-(S8) is derivable by means of intuitive set theory, which was 
presupposed as a background theory (see Mendelson, 1973, chapter 2, 
Theorems 3.1 and 5.1). 

Any theory that has the same theorems as S is often referred to in the 
literature as Peano arithmetic, or simply PA. 

From (S9) by MP, we can obtain the induction rule : 

^(0), (Vx)(^(x) => $8{x')) h s (Vx)^(x). 

It will be our immediate aim to establish the usual rules of equality; that 
is, we shall show that the properties (A6) and (A7) of equality (see page 95) 
are derivable in S and, hence, that S is a first-order theory with equality. 

First, for convenience and brevity in carrying out proofs, we cite some 
immediate, trivial consequences of the axioms. 


^However, (S9) cannot fully correspond to Peano’s postulate (P5), since the 
latter refers intuitively to the 2^'° properties of natural numbers, whereas (S9) can 
take care of only the denumberable number of properties defined by wfs of £f A . 
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LEMMA 3.1 

For any terms t, s, r of & ? a, the following wfs are theorems of S. 

(SI') t — r => (t = s => r = 5 ) 

(S2') t = r=>t' = i J 
(S3') 0 £ t' 

(S4') t* = t-' =>t = r 

(S5') f + 0 = / 

(S6') t + ^ = (t + r)' 

(S7') t- 0 = 0 

(S8') t-r J = (t-r) + t 


Proof 


(S1')-(S8') follow from (S1)-(S8), respectively. First form the closure by 
means of Gen, use Exercise 2.48 to change all the bound variables to vari- 
ables not occuring in terms t, r, s, and then apply rule A4 with the appro- 
priate terms t , r, .vj 


PROPOSITION 3.2. 

For any terms f, s, r, the following wfs are theorems of S. 

(a) t = t 

(b) t — r =$■ r = t 

(c) t = r=$-(r = s=±t = s) 

(d) r ~ t => (s = t=> r = s) 

(e) t = r=>t-\-s = r + s 

(f) t = 0 + 1 

(g) ^ + r = (* + /*/ 

(h) t + r = 7* + t 

(i) t = 7 * =*► s + 1 = s + r 

(j) (t + r) 4- s = / 4- (r + 5 ) 

(k) t = r=^t-s — r-s 

(l) 0-f = 0 

(m) f • 7* = t • r 4- r 

tThe change of bound variables is necessary in some cases. For example, if we 
want to obtain X 2 = x\ => x' 2 = x'j from x\ =X 2 => x'j = x' 2 , we first obtain 
(Vxi)(Vx 2 )(xi —X 2 => x'j = x 2 ). We cannot apply rule A4 to drop (Vxi) and replace x\ 
by X 2 , since X 2 is not free for xi in (Vx 2 )(xi =X 2 => x , l = X 2 ). From now on, we shall 
assume without explicit mention that the reader is aware that we sometimes have to 
change bound variables when we use Gen and rule A4. 




(n) tr — r-t 

(o) t = r=>s-t = s-r 
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Proof 

(a) 1. 

t + 0 = t 

(S5') 

2. 

(t + 0 = t) =» {t + 0 = t =» t = t) 

(sn 

3. 

*** 

+ 

o 

li 

II 

1, 2, MP 

4. 

t = t 

1, 3, MP 

(b) 1. 

t = r =y (t = t => 7' = t) 

(SI') 

2. 

II 

ft 

4 

[[ 

1, tautology, MP 

3. 

t = r => r = t 

2, part (a), MP 

(c) I- 

r = t => (r = 5 =>• t — s) 

(SI') 

2. 

* 

Hi* 

ft 

* 

Hi* 

II 

4 * 

Part (b) 

3. 

t = r =y (r = 5 =4- t — s) 

1, 2, tautology, MP 

(d)l. 

r — t =+ (t = s => r = s) 

Part (c) 

2. 

t = s^(r = t=>r = s) 

1, tautology, MP 

3. 

s = t t — s 

Part (b) 

4. 

s = t => (r = t = 4 - r — s) 

2, 3, tautology, MP 


(e) Apply the induction rule to 8ft {z) :x=y=yx + z — y + z. 


(i) 1. 

x + 0 =x 

(S5') 

2. 

y + 0=y 

(S5') 

3. 

x = y 

Hyp 

4. 

x + 0=y 

1, 3, part (c), MP 

5. 

x + 0 = y + 0 

4, 2, part (d), MP 

6. 

hs x = + 0 = >> + 0 

1—5, deduction theorem 


Thus, h s 8ft(0). 


(ii) 1 . x = y =y x -h z = y + z 

2. x = y 

3. x + z! = {x + z)' 

4. y + z! = (y + z :)' 

5. x+z = y+z 

6. (x + z)' = (y + z)' 

7. x + z! — (y + z)' 

8. x + z! = y + d 

9. h s (x =y=> x + z — y + z) => 

(jc = ^=4^x + z / — + z') 


Hyp 

Hyp 

(S6') 

(S6 # ) 

1 , 2, MP 
5, (S2'), MP 

3, 6, part (c), MP 

4, 7, part (d), MP 

1-8, deduction theorem twice 


Thus, hs 8ft{z) => ^(z 7 ), and, by Gen, hs (Vz)(^?(z) =>■ ^'(z')). Hence, 
hs (Vz)^(z) by the induction rule. Therefore, by Gen and rule A4, 
hs t = r =$■ t + s = r + s. 

(f) Let 8ft(x) be x = 0 + x. 

(i) h s 0 = 0 + 0 by (S5'), part (b) and MP; thus, h s 8ft{0). 
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(ii) 1. x = 0 + x Hyp 

2. 0 + x' = (0 + x) f (S6') 

3. x' = (0 + x)' 1, (S2'), MP 

4. x! — 0 + x' 3, 2, part (d), MP 

5. hs x = 0 + x =+ x' = 0 + x' 14, deduction theorem 



Thus, h s 33(x) => 33(x') and, by Gen, hs (Vx) [ffi(x) 33(x')). So, by (i)^ 
(ii) and the induction rule, hs (Vx)(x = 0 + x), and then, by rule A4 
h§ t = 0 4" t. 


(g) Let 33 {y) be x' + y = ( x +y)'- 

(i) 1. x> + 0 = x' (S5') 

2. x + 0 = x (S5') 

3. (x + 0)' = x' 2, (S2'),MP 

4. x' + 0 = (x + 0/ 1, 3, part (d), MP 


Thus, h s 33(0). 

(ii) 1. x!+y=(x + y)' Hyp 

2. x? +y = (x / + y)' (S6') 

3. (x' + y)' = (x + y)" l, (S2'), MP 

4. x r + y' = (x + y ) " 2, 3, part (c), MP 

5. x + / = (x + y)' (S6') 

6. (x + y')' = (x + y)" 5, (S2'), MP 

7. x' +y = (x + y')' 4, 6, part (d), MP 

8. hs x / +T = (x + y) 1 7 } deduction theorem 

x' +y = (x +y/ 

Thus, hs 33 (y) =y 33 (y ; ) , and, by Gen, h s (Vy) (33 (y) =y 33(y')). Hence, by 
(i), (ii) and the induction rule, hs (Vy)(x' +y — (x + y)'). By Gen and rule 
A4, hs t 1 + r = (t + r)'. 

(h) Let 33(y) be x + y = y + x. 

(i) 1. x + 0 = x (S5') 

2, x = 0 + x Part (f) 

3. x+0=0+x l, 2, part (c), MP 


Thus, h s 33(0). 

(ii) 1. x+y =y+x 

2. x + y = (x + y)' 

3. y+x = (y + x)' 

4. (x + y)' = (y + x)' 

5. x + y' = (y + x) 7 

6. x + y = y + x 

1. hs x + y = y + x =+ 
x+y = y + x 


Hyp 

(S6') 

Part (g) 

1 , (S2'), MP 

2, 4, part (c), MP 
5, 3, part (d), MP 

1 -6, deduction theorem 
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Thus, h s &{y) =+ &&) and, by Gen, h s (Vy)(^Cy) => ^(/)). So, by (i), (ii) 
and the induction rule, h s (Vy)(x +y — T + x)- Then, by rule A4, Gen and 
rule A4, h s t + r = r + 1. 
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1. t = r=>t + s = r + s 

Part (e) 

2. t + s — s + t 

Part (h) 

3. r + 5 = sr + r 

Part (h) 

4. t — r 

Hyp 

5. t + 5 = r + s 

1, 4, MP 

6. s + 1 = r + s 

2, 5, (SP) MP 

7. s + t — s + r 

6, 3, part (c), MP 

8. h s t = r=>s + t = s + r 

1-7, deduction theorem 

Let #(z) be (x + y) 4- z = x + (y + z). 

(i) 1. (x + .y) + 0 = x +y 

(S5') 

2. y + 0=y 

(S5') 

3. x + (y + 0) = x + y 

2, part (j), MP 

4. (x + y) + 0 = x + (y + 0) 

1, 3, part (d), MP 

Thus, hs J'(O). 

(ii) 1. (x +y) + z =x + (y + z) 

Hyp 

2. (x + j>) +z! = ((x+y)+z)' 

(S6 ; ) 

3. ((x + y) + z)' = (x + 0 + z))' 

1, (S2'), MP 

4. (x + y)+z! = (x+ (y + z))' 

2, 3, part (c), MP 

5. y + z* = (y + z)' 

(S6') 

6. x + (y + z!) = x + (y + z/ 

5, part (i), MP 

7. x + (y + z/ = (x + (y + z))' 

(S6') 

8. x + (y + z 7 ) = (x + (y+z))' 

6, 7, part (c), MP 

9. (x + y) + 2 ! = x + (y + zf) 

4, 8, part (d), MP 

10 . h s (x T y) + z =x + (y + z) =*> 

1-9, deduction theorem 

(x + y) + z f = x + (y + z!) 


Thus, hs ${z) =+ M(z!) and, by Gen, hs (Vz)(^(z) =+ (^(z')). So, by (i), 
(ii) and the induction rule, hs (Vz)^(z), and then, by Gen and rule A4, 
hs (f + r) + j = ? + (/* + 5). 


Parts (k)-(o) are left as exercises. 


COROLLARY 3.3 

S is a theory with equality. 


Proof 

By Proposition 2.25, this reduces to parts (a) (e), (i), (k) and (o) of pro- 
position 3.2, and (S2'). 
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Notice that the interpretation in which: 

(a) the set of non-negative integers is the domain 

(b) the integer 0 is the interpretation of the symbol 0 

(c) the successor operation (addition of 1) is the interpretation of the ' 
function (that is, of //) 

(d) ordinary addition and multiplication are the interpretations of + and • 

(e) the interpretation of the predicate letter = is the identity relation 

is a normal model for S. This model is called the standard interpretation or 
standard model. Any normal model for S that is not isomorphic to the 
standard model will be called a non-standard model for S. 

If we recognize the standard interpretation to be a model for S, then, of 
course, S is consistent. However, this kind of semantic argument, involving 
as it does a certain amount of set-theoretic reasoning, is regarded by some as 
too precarious to serve as a basis for consistency proofs. Moreover, we have 
not proved in a rigorous way that the axioms of S are true under the 
standard interpretation, but we have taken it as intuitively obvious. For 
these and other reasons, when the consistency of S enters into the argument 
of a proof, it is common practice to take the statement of the consistency of 
S as an explicit unproved assumption. 

Some important additional properties of addition and multiplication are 
covered by the following result. 

PROPOSITION 3.4 

For any terms t, r,s, the following wfs are theorems of S. 

(a) t ■ (r + s) = (t • r) + (t • s) (distributivity) 

(b) (r + 5 ) ■ t = (r * t) + (5 • t) (distributivity) 

(c) (t • r) ■ s — t ■ (r • s) (associativity of ■) 

(d) t + s — r + s=yt = r (cancellation law for + ) 

Proof 

(a) Prove bs x ■ (y + z) = (x ■ y) + (x • z) by induction on z. 

(b) Use part (a) and Proposition 3.2(n). 

(c) Prove bs (x • y) ■ z = x ■ (y ■ z) by induction on z. 

(d) Prove bs x + z~y + z=yx=y by induction on z. This requires, for the 
first time, use of (S4'). 

The terms 0, O', 0", 0 W , , we shall call numerals and denote by 
0, 1, 2, 3, — More precisely, 0 is 0 and, for any natural number n , n + 1 is 
(7 j) / . In general, if n is a natural number, n stands for the numeral consisting 
of 0 followed by n strokes. The numerals can be defined recursively by 
stating that 0 is a numeral and, if u is a numeral, then u' is also a numeral. 



AN AXIOM SYSTEM 


PROPOSITION 3.5 

The following are theorems of S. 

(a) t +_1 = t 

(b) t-\ = t 

(c) t ’ 2 = t + t 

M) ^ T 5 = 0 =$■ £ = 0 A s = 0 

(e) t ^ 0 =4 >_(s • r = 0 => s = 0) 

(f) t + s — 1 = 4 - (t = 0 As =_1) V (r = 1 As — 0) 

(g) /•5 = I=4(^ = Ta.v = I) 

(h) t ^ 0 => (3y){t = y) 

(i) s ^ 0 =y (t • s = r • s => t = r) 

(j) , ^ 0 ^ (t 1 ^ (3y)(t = /)) 


Proof 


(a) 1 • t + 0' — {t 4- 0) 

2. f + 0 = t 

3. (r + 0)' = /' 

4. ^ + 0' = ^ 

5. t+\=t’ 

(b) U-0' = E0 + r 

2. t- 0 = 0 

3. f 0+f=0+ t 

4. t • 0' = 0 + t 

5. 0 + t = t 

6. t- O' = t 

1. t ' \ —t 

(c) 1. 1- (T)' = (t -T) + t 

2. t - 1 = t 

3. (t-T) + t = t + t 
4 .t.(l)' = t + t 

5. t • 2 = t T t 


(S6') 

(S5') 

2, (S2'), MP 

1, 3, Proposition 3.2(c), MP 
4, abbreviation 

(S8') 

(S7') 

2, Proposition 3.2(e), MP 

1, 3, Proposition 3.2(c), MP 
Proposition 3.2(f,b), MP 

4, 5, Proposition 3.2(c), MP 

6, abbreviation 

(S8') 

Part (b) 

2, Proposition 3.2(e), MP 

1, 3, Proposition 3.2(c), MP 
4, abbreviation 


(d) Let &(y) bex + y , = 0=4x = 0Ay’ = 0. It is easy to prove that h s ^(0). 
Also, since hs (* + y)' 7 ^ 0 by (S3') and Proposition 3.2(b), it follows by 
(S6') that h s x + /^0. Hence, hs^(/) by the tautology 
-1 A => (A B). So, hs @{y) =4=- ^(/) by the tautology A =4 (B =4- A). 
Then, by the induction rule, hs (\/y)&(y) and then, by rule A4, Gen and 
rule A4, we obtain the theorem. 

(e) The proof is similar to that for part (d) and is left as an exercise. 

(f) Use induction on y in the wf x +y = 1 = 4 * ((x = 0 At = 1)V 

(x = 1 A >> — 0)). 

(g) Use induction on y in x • y = 1 = 4 - (x = 1 A y = 1). 




(h) Perform induction on x in x y 0 => (3w) (x = w'). 

(i) Let &(y) be (Vx)(z y 0 => (x-z = y-z=^ x = j>)). 

(i) 1. z y 0 Hyp 


2. x • z = 0 • z 

Hyp 

3. 0 - z = 0 

Proposition 3.2(1) 

4. x ■ z = 0 

2, 3 Proposition 3.2(c), MP 

5. x = 0 

1, 4, part(e), MP 

6. bs z y 0 =?► (x-z = 0- z=>x = 

0) 1-5, deduction theorem 

7. b s (Vz)(z y 0 (x • z = 0 - z 

6, Gen 

=» x = 0)) 

Thus, b s ^(0). 

(ii) 1 . (Vx)(z y 0 =*• (x - z = y-z => x 

=y)) Hyp (*(y)) 

2. z y 0 

Hyp 

3 . x • z = y ■ z 

Hyp 

4. y y 0 

(S3'), Proposition 3.2(b), MP 

5. y ■ z y o 

2, 4, part (e), a tautology, MP 

6. x ■ z y 0 

3, 5, (SI'), tautologies, MP 

7. x y o 

6, (S7'), Proposition 3.2(o,n), 
(SI'), tautologies, MP 

8. (3w)(x - n/) 

7, part (h), MP 

9 . x = b* 

8, rule C 

10. b' -z = y • z 

3, 9, (A7), MP 

11. b-z + z = y'z + z 

10, Proposition 3.2(m,d), MP 

12. b • z = y • z 

11, Proposition 3.4(d), MP 

13. z^0=>(b‘z — y-z=>b = y) 

1, rule A4 

14. b-z=y-z=yb = y 

2, 1 3, MP 

15 . b^y 

12, 14, MP 

16 . # =y 

1 5, (S2'), MP 

17 . x =y 

9, 16, Proposition 3.2(c), MP 

18. ^(y),z y 0,x • z = y • z bs x — 

y 1-17, Proposition 2.10 

19. ^(y) bs z y 0 => 

1 8, deduction theorem twice 

(x • z = y • 2 => x = y ) 

20. 3l(y) bs (Vx)(z y0=^ 

19, Gen 

(x • z = y • z =>• x = y)) 

21 . h-s 0 (y) =5- »{/) 

20, deduction theorem 


Hence, by (i), (ii), Gen, and the induction rule, we obtain h§ (Vy)&(y) 
and then, by Gen and rule A4, we have the desired result. 

(j) This is left as an exercise. 

PROPOSITION 3.6 


(a) Let m and n be any natural numbers. 
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(i) If m 7^ n, then Fg m ^n. 

(ii) l— § m + n = m T n and Eg m~n = m -n. 

(b) Any model for S is infinite. 

(c) For any cardinal number ftp, S has a normal model of cardinality ftp. 


Proof 

(a)(i) Assume m n. Either m < n or n < m. Say, m < n. 
\.rn — n . Hyp 

m times 


n tunes 


2. 0" . . .' = O'" . . .' 


1 is an abbreviation of 2 

n — vi times 


3. Apply (S4') and MP m times in a row. We get 0 = 0"../- Let t be 
n — m — 1. Since n > m, n — m — 1^0. Thus, we obtain 0 = t'. 

(S3') 

5. 0 = t 1 A 0 7 ^ t 1 3, 4, conjunction introduction 

6. m = n\-s0 = t , A0^t } 1-5 

7. hg in 7 ^ n 1-6, proof by contradiction 


A similar proof holds in the case when n < m. (A more rigorous proof can 
be given by induction in the metalanguage with respect to «.) 

(ii) We use induction in the metalanguage. First, m + 0 is m. Hence, 
|~s m + 0 = m + 0 by (S5'). Now assume hg m + n = m + n. Then 
b s (m + n)' = m + ( n )' b y (S2') and (S6'). But m + (n + l) is (m + n)' and 
n+ 1 is (riy. Hence, hg m + ( n + 1) = m + n + 1 . Thus, Fs rrT+n = m + «. 
The proof that Fg m • n = m ■ n is left as an exercise. 

(b) By part (a), (i), in a model for S the objects corresponding to the nu- 
merals must be distinct. But there are denumberably many numerals. 

(c) This follows from Corollary 2.34(c) and the fact that the standard model 
is an infinite normal model. 


An order relation can be introduced by definition in S. 


DEFINITIONS 

t < s for (3w)(w = 

for t < sV t — s 
t > s for s < t 
t^s for sjgT 

t £ s for -i {t < 5), and so on 

In the first definition, as usual, we choose w to be the first variable not in 
t or s. 
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PROPOSITION 3.7 

For any terms the following are theorems. 


(a) t ft t 

(b) t < s => (5 < r =$■ t < r) 

(c) t < s => s ft t 

(d) t<s4$t + r<s + r 

(e) t^t 

(f) t^s=> t^r) 

(g) t^s4$t + r^s + r 

(h) => (5 < r => t < r) 

(i) O^t 

Q 0<t > 

(k) t <r 4$ t*^r 

(l) t^r4^t<i J 

(m) t < t 

(n) 0 < 1, 1 < 2, 2 < 3, . . . 

Proof 

(a) l. t < t 

2, (3w)(w ^OA w + t ~t) 

3, b^GAb + t= t 

4, b T" t ~ t 

5, t = 0 + f 

6, Z> + 1 = 0 + 1 
1 . 6 = 0 

8. 6/ 0 

9 .^ = 0A^0 

10. 0 = 0 AO ^0 

11. f <fh s 0 = 0A0^0 

12. Fs t ft t 

(b) 1, t < s 

2. s <r 

3. (3w)(w /0Aw + ( = ^) 

4. (3y)(y ^0 Ap + s = /‘) 

5. b^OAb + t — s 

6. cjtOAc+s — r 

7 . b 1 = s 

8. c 3- s = r 

9. c + (6 + f) = c + ^ 

10. c 3- (6+ 0 = r 

11. (c + b) + t = r 

12. A 7^ 0 


(o) t ^ r [t < r\! r < t) 

(p) t = r\J t <r\l r <t 

(q) 

(r) f + 

(s) r ^ 0 =>- f + r > t 

(t) r ft 0 =?► t - r ^ t 

(u) r/0«»'>0 

(v) r > 0 =>- (t > 0 =>■ r ■ t > 0) 

(w) r ^ 0 => (f > 1 f ■ r > r) 

(x) 7‘ ^ 0 =3- (f < s 34* f ■ r < 5 • r) 

(y) r ^ 0 (f^s f • • r) 

(z) t ft 0 

(z?) t^r A r^t => t = 7 * 


Hyp 

1 is an abbreviation of 2 

2, rule C 

3, conjunction rule 
Proposition 3.2(f) 

3, 4, Proposition 3.2(c), MP 

6, Proposition 3.4(d), MP 
3, conjunction elimination 

7, 8, conjunction elimination 
9, tautology: B A ->B => C, MP 
1-10, Proposition 2.10 

1-11, proof by contradiction 

Hyp 

Hyp 

1 is an abbreviation of 3 

2 is an abbreviation of 4 

3, rule C 

4, rule C 

5, conjunction elimination 

6, conjunction elimination 

7, Proposition 3.2(i), MP 

9, 8, Proposition 3.2(c), MP 

10, Proposition 3.2(j,c), MP 
5, conjunction elimination 
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13. c T* b 7^ 0 

14. c + b^0A(c-\-b) + t = r 

15. (3 u)(u ^ 0 A u 4 - 1 = r) 

16. t < r 

17. Ps t < s < r =s- f < r) 
parts (c)-(z / ) are left as exercises. 


12, Proposition 3.5(d), 
tautology, MP 

13, 11, conjunction introduction 

14, rule E4 
Abbreviation of 15 
1-15, Proposition 2.10, 
deduction theorem 


PROPOSITION 3.8 

(a) F or any natural number k, h s x = 0 V . . . V x = k 4$ x ^ k. 

(a') For any natural number k and any wf 38, F s 38(0) A ^(1) A . . . A38(k) 
(Vx)(x^A: => 38(x)). 

(b) F or any natural number k > 0, Ps x — 0 V . . . V x = (*- 1) 4$x<k 
(b') For any natural number k > 0 and any wf 38, F s ^(0) A ^(1) A . . . 

f\3fi(k — 1) (Vx)(x < k => ^?(x)). 

(c) Ps ((Vx)(x < y =4- *(x)) A (\/x)(x^y => <?(x))) => (Vx)(^(x) V ^(x)) 

Proof 

(a) We prove Ps x = 0 V . . . Vx = k 45 x^k by induction in the metalan- 
guage on k. The case for k = 0, Ps x = 0 45 x ^ 0, is obvious from the defi- 
nitions and Proposition 3.7, Assume as inductive hypothesis F s x = 0 V . . . 
Vx = k x ^ k. Now assume x = 0V...Vx = /cVx = k 4- 1 . But Ps x = 
k+ 1 x ^ A: + 1 and, by the inductive hypothesis, P s x = 0V...Vx = 
k =5 x^k. Also Psx^A:=4-x^A:+ 1. Thus, x^k-\-l. So, Psx = 0V... 
Vx = /cVx = A:+l =5 x^k -T l . Conversely, assume x^k + 1 . Then 
x — £+lVx<&+l. If x = k + 1 , then x = 0V...Vx = AVx = A+l. If 
x < k + 1, then since k 4- 1 is (/c/, we have x^/c by Proposition 3.7(1). By the 
inductive hypothesis, x = 0 V . . . V x = k, and, therefore, x = 0 V . . . V x = 
A: V x = A: -f 1 . In either case, x = 0V...Vx = AVx = A+ 1 . This proves 
Ps x^ A - + 1 =>x = 0V...Vx — A:Vx = A; + 1. From the inductive hypoth- 
esis, we have derived Pgx = 0V...Vx = A: + 1 45 x^k + l and this com- 
pletes the proof. (This proof has been given in an informal manner that we 
shall generally use from now on. In particular, the deduction theorem, the 
eliminability of rule C, the replacement theorem, and various derived rules 
and tautologies will be applied without being explicitly mentioned.) 

Parts (a'), (b), and (b') follow easily from part (a). Part (c) follows almost 
immediately from Proposition 3.7(o), using obvious tautologies. 

There are several stronger forms of the induction principle that we can 
prove at this point. 
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PROPOSITION 3.9 

(a) Complete induction. E s ( Vx) ( (Vz) (z <x => &(z)) -> &(x)) =>• (Vx)^(A 
In ordinary language, consider a property P such that, for any x, if p 
holds for all natural numbers less than x, then P holds for x also. Then P 
holds for all natural numbers. 

(b) Least-number principle, Es (3x)&(x) => (3j) (Sd(y) A (Vz) (z <y=> 

If a property P holds for some natural number, then there is a least 
number satisfying P. 


Proof 


(a) Let r A(x) be ( Vz) (z ^ x => £%i(z)). 

(i) 1. (Vx)((Vz)(z < x => $ (z) ) £A{x)) 

2. (Vz)(z < 0 => &{z)) =>■ &(0) 

3. z ^ 0 

4. (Vz)(z < 0 => ^?(z)) 

5. <0(0) 

6. (Vz) (z ^ 0 ^ ^(z)) i.e., tf(0) 

7. (Vx)((Vz)(z < x ^(z)) 

=> ^(x)) hs r m 

(ii) 1. (Vx)((Vz)(z < x => ^(z)) => ^(x)) 

2. r <T(x), i.e., (Vz)(z^x ^ #(z)) 

3. (Vz)(z < x' $ (z)) 

4. (Vz)(z <x' => &(z)) => #(x') 

5. ^(x') 

6. z <x^ => z <x / Vz = x / 

7. z < x' => 0H{z) 

8. z = x* => ^(z) 


9. (Vz)(z^x' ^?(z)) i.e., ^(x') 

10. (Vx)((Vz)(z < x => ^?(z)) => ^(x)) 
h s (Vx)(^(x) =4^ ^(x')) 


Hyp 

1, rule A4 
Proposition 3.7(y) 

3, tautology, Gen 

2, 4, MP 

5, Proposition 3.8(a') 

1-6 

Hyp 

Hyp 

2, Proposition 3.7(T) 

1, rule A4 

3, 4, MP 

Definition, tautology 
3, rule A4 

5, axiom (A7), Proposition 
2.23(b), tautologies 

6, 7, 8, Tautology, Gen 

1-9, deduction theorem, Gen 


By (i), (ii) and the induction rule, we obtain Q) Es (Vx) r A(x), that is, 
Es (Vx) (Vz) (z s^x => ^(z)), where S’ is (Vx)((Vz)(z < x => ^(z)) => ^(x)). 
Hence, by rule A4 twice, ^ Es x^x ==> ${x). But E s x^x. So, S> Es ^(x), 
and, by Gen and the deduction theorem, Es S> => (Vx)^(x). 


(b) 1. ->(3 \y){@{y) A (Vz)(z < y => &(z))) Hyp 

2. (Vy)~>(&(y) A (Vz)(z < p => -i^(z))) 1, derived rule for negation 

3. ( Vy) ( (Vz) (z < y => -i^(z)) ~i &(y)) 2, tautology, replacement 

4. (Vv)-i^(y) 3, part (a) with instead of ^ 

5. — i(3> , )^ i, (>') 4, derived rule for negation 
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6. ->(3x)$(x) 

7. bs -(3j)(^(y)A(Vz)(z<T=* 

- i ^( z ))) -^(3x)&(x) 

8. b s (3x)^(x) ^ (3y){m{y)A 
(Vz)(z <T ^ ^(z))) 


5, change of bound variable 
1-6, deduction theorem 
7, derived rule 
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Exercise 

3.1 ( Method of infinite descent) 

Prove bs (Vx)(^(x) => (3y)(y < x A ^(y))) =4- (Vx)-i^(jc) 

Another important notion in number theory is divisibility, which we now 
define. 


DEFINITION rjs for (3z) (s = t • z). (Here, z is the first variable not in t or 5 .) 

PROPOSITION 3.10 

The following wfs are theorems for any terms t,s,r. 

(a) t\t 

(b) 

(c) r |0 

(d) t\s A s|r => t\r 

(e) s 0 At\s =5* t^s 

(f) f As\t s = t 

(g) t\s^t\(r-s) 

(h) t\s At\r r |(5 + r) 

Proof 

(a) t ~ t • 1. Hence, f|f. 

(b) t = 1 * t. Hence, 1 |f. 

(c) 0 — t • 0. Hence, f[0. 

(d) If s = t • z and r = s ■ w, then r = t ■ (z ■ w). 

(e) If s 0 and r |s, then s — t-z for some z. If z — 0, then 5 = 0. Hence, 

z / 0. So, z = \I for some u. Then s — t • ( 1 /) = t • « + t^t. 

(f) -(h) These proofs are left as exercises. 


Exercises 

3.2 Prove bs f j 1 =$■ t = 1. 

3.3 Prove bs (f |5 A f 1^) 1 = 1 . 
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It will be useful for later purposes to prove the existence of a unique 
quotient and remainder upon division of one number x by another nonzero 
number y. 

PROPOSITION 3.11 

hs y 7 ^ 0 (3m) (3y)[x = y-u + vAv<yA (Vmi) (Vtfi) ((x = y ■ u\ + vyA, 

v i < y) =>- u = Mi A v = Ui )] 


Proof 

Let &(x) be y ^ 0 => (3w)(3t')(jt — y'U + vAv<y). 


(i) l.y^O 

2. 0 = y • 0 + 0 

3. 0 < y 

4. 0 = y*0 + 0A0<y 

5 . ( 3 m ) ( 3 u ) (0=y-u + vAv <y) 

6. y ^ 0 => ( 3 m )( 3 i ;)(0 = y ■ u + v 

A v <y) 

(ii) l. #(jc) i.e., y ^ 0 => (3m) (3u) 

(jc = y ■ u + v A v < y) 

2.y^0 

3. ( 3 m )( 3 m)(jc = y-u + vAv<y) 

4. x = y a + b Ab < y 

5. b <y 

6. b’ ^y 

7. V <y\I V ~y 


11 . b 1 =y=> (x l —y- (a - f 1) 

+ 0 A 0 < y) 

12. b 1 = y => (3n)(3r)(x:' = y • m + 
v A v < y) 

13. (3m)(3i;)(.x:' = y • u 4- v Av <y) 

14. #(*) =* (y ± 0 =* (3m) (3m) 

{x r —y-u+v Av < y)) 

i.e., ${x) => ^( x ’ ) 


Hyp 

(S5') 9 (S70 

1, Proposition 3.7(t) 

2, 3, conjunction rule 
4, rule E4 twice 

1-5, deduction theorem 

Hyp 

Hyp 

I, 2, MP 

3, rule C twice 

4, conjunction elimination 

5, Proposition 3.7(k) 

6, definition 

4, (S6'), derived rules 

8, rule E4, deduction theorem 
4, (S6'), Proposition 3.5(b) 

10, Proposition 3.4, 2, 
Proposition 3.7(t), (S5') 

I I , rule E4 twice, deduction 
theorem 

7, 9, 12, disjunction elimination 
1-13, deduction theorem 


8. b f < y =$■ (x f — y • a + V A b' < y) 

9. b' < y (3M)(3r)(x' = y ■ u 
4- v A v < y) 

10. V = y => x r = y • a 4- y • 1 


By (i), (ii), Gen and the induction rule, hs (Vx)^(x). This establishes the 
existence of a quotient u and a remainder v. To prove uniqueness, proceed 
as follows. Assume y ^ 0. Assume x=y-u + v Av <y and x — y ■ u\-y 
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p. /\v\ < y. Now, u = u\ or u < it] or u\ < u. If u = u i, then v — vi by 
proposition 3.4(d). If u < ui, then u\=u + w for some w/0. Then 
, . u _|_ v = y ■ (11 + vr) + vi = y ■ u -p y • w 4- V \ . Hence, v = y-w + v i. Since 
P 0,y • w^y. So, v = y ■ w\ V\ ^ y , contradicting v < y. Hence, u u\. 
Similarly, u\ u. Thus, u = u\. Since y- u ■+- v = x = y • ui + v\, it follows 
that v = vi . 

From this point on, one can generally translate into S and prove the 
results from any text on elementary number theory. There are certain 
number-theoretic functions, such as x y and x!, that we have to be able to 
define in S, and this we shall do later in this chapter. Some standard results 
of number theory, such as Dirichlet’s theorem, are proved with the aid of the 
theory of complex variables, and it is often not known whether elementary 
proofs (or proofs in S) can be given for such theorems. The statement of 
some results in number theory involves non-elementary concepts, such as 
the logarithmic function, and, except in special cases, cannot even be for- 
mulated in S. More information about the strength and expressive powers of 
S will be revealed later. For example, it will be shown that there are closed 
wfs that are neither provable nor disprovable in S, if S is consistent; hence 
there is a wf that is true under the standard interpretation but is not pro- 
vable in S. We also will see that this incompleteness of S cannot be attrib- 
uted to omission of some essential axiom but has deeper underlying causes 
that apply to other theories as well. 

Exercises 

3.4 Show that the induction principle (S9) is independent of the other 
axioms of S. 

3.5° 

(a) Show that there exist non-standard models for S of any cardinality N a . 

(b) Ehrenfeucht (1958) has shown the existence of at least 2 Ko mutually non- 
isomorphic models of cardinality N a . Prove the special case that there 
are 2 ! '° mutually non-isomorphic denumerable models of S. 

3.6 d Give a standard mathematical proof of the categoricity of Peano’s 
postulates, in the sense that any two ‘models’ are isomorphic. Explain why 
this proof does not apply to the first-order theory S. 

3.7° (Presburger, 1929) If we eliminate from S the function letter / 2 2 for 
multiplication and the axioms (S7) and (S8), show that the new system S + is 
complete and decidable (in the sense of Chapter 1, p. 34). 

3.8 

(a) Show that, for every closed term t of S, we can find a natural number n 
such that hs t = n. 

(b) Show that every closed atomic wf t = s of S is decidable — that is, either 
hs t = s or Fs t 7^ s. 

(c) Show that every closed wf of S without quantifiers is decidable. 



FORMAL NUMBER THEORY 


3.2 NUMBER-THEORETIC FUNCTIONS AND RELATIONS 

A number-theoretic function is a function whose arguments and values are 
natural numbers. Addition and multiplication are familiar examples of 
number-theoretic functions of two arguments. By a number-theoretic rela- 
tion we mean a relation whose arguments are natural numbers. For example 
= and < are binary number-theoretic relations, and the expression 
x + y < z determines a number-theoretic relation of three arguments.! 
Number-theoretic functions and relations are intuitive and are not bound up 
with any formal system. 

Let K be any theory in the language 5£a of arithmetic. We say that a 
number-theoretic relation R of n arguments is expressible in K if and only if 
there is a wf 3$(x\ , . . . , x „ ) of K with the free variables xi, . . . ,x„ such that, 
for any natural numbers ki , . . . ,k„, the following hold: 

1 . If R(£i , . . . , k„) is true, then hj£ ffl{k \ , . . . , k n ). 

2. If R(&i , . . . , k„) is false, then hj^ -i 0&{k \ , . . . , k„). 

For example, the number- theoretic relation of identity is expressed in S 
by the wf x i = * 2 . In fact, if k\ = & 2 , then k\ is the same term as /C 2 and so, by 
Proposition 3.2(a), h s k\ — hi. Moreover, if k\ f kj, then, by Proposition 
3.6(a), hs k\ f k 2 . 

Likewise, the relation ‘less than’ is expressed in S by the wf x\ < xj. Recall 
that x\ < A '2 is ( 3 X 3 ) (*3 f 0 A A 3 + xi = A 2 ). If k\ < then there is some 
non-zero number n such that ki = n-yk\. Now, by Proposition 3.6(a)(ii), 
hs /f 2 = h + k\. Also, by (S3'), since n f 0, h s h f 0. Hence, by rule E4, one 
can prove in S the wf (3w)(w + =k 2 ); that is, hs k\ < fo. On the 

other hand, if k\ fk 2 , then ^2 < k\ or k 2 =k\. If ^2 < k\, then, as we have 
just seen, hs /C 2 <k\. If k 2 = k\, then hs /f 2 = k\. In either case, hs k 2 ^ k\ 
and then, by Proposition 3.7(a,c), h s k\ f k 2 . 

Observe that, if a relation is expressible in a theory K, then it is ex- 
pressible in any extension of K. 


Exercises 

3.9 Show that the negation, disjunction, and conjunction of relations that 
are expressible in K are also expressible in K. 

3.10 Show that the relation x + y — z is expressible in S. 


!We follow the custom of regarding a number-theoretic property, such as the 
property of being even, as a ’relation’ of one argument. 
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Let IC be any theory with equality in the language a of arithmetic. A 
number-theoretic function / of n arguments is said to be representable in IC 
if and only if there is a wf 08{x u-..,x n ,y) of K with the free variables 

such that, for any natural numbers k \ the following 

hold: 

1. If f{k\, • • . ,K) = m ± then h K @(k u . . . ,k n) m ). 

2. I~k (3i 

If, in this definition, we replace condition 2 by 

2'>k (3i 

then the function f is said to be strongly representable in K. Notice that 2' 
implies 2, by Gen and rule A4. Hence, strong representability implies re- 
presentability. The converse is also true, as we now prove. 


PROPOSITION 3.12 (V.H. DYSON) 

If /(jci, . . . , x „ ) is representable in K, then it is strongly representable in IC. 


Proof 

Assume / representable in K by a wf 33{x \, . . . ,x„,y). Let us show that / is 
strongly representable in K by the following wf ^(jq, . . . ,x ft ,y) : 

(piy)^(xi, . . - ,x„,y)] A &(x h . . . ,x„,y)) V (->[(3 \y)3S{x \ t . . . Ay = 0) 

1. Assume k„) = m. Then t-j^ ^(/q, . . . and (3iy) 

@(k u • • • ,/fnj). So, by conjunction introduction and disjunction introduc- 
tion, we get <g(k u ... ,k n ,m ). 

2'. We must show (3i y)^(x \ y . . . ,x„,y ). 

Case 1. Take (3[y)^(x[ 1 . . . ,x n ,y) as hypothesis, (i) It is easy, using rule 
C, to obtain &(xi,...,x „,b) from our hypothesis, where b is a new in- 
dividual constant. Together with our hypothesis and conjunction and dis- 
junction introduction, this yields ^(jq, . . . , x, n b) and then, by rule E4, 
(3y)^(jq , . . . , x„ , y) . (ii) Assume %>(x \ , . . . , x„, u ) A c &(x \ , . . . , x n , v). From 

and our hypothesis, we obtain 88 {x \, . . . ,x„, u), and, from 
^(xi , . . . ,jc„, v) and our hypothesis, we obtain ^?(jq, . . . y x n , v). Now, from 
^(jq, . . . ,x„, u) and &(x \, . . . ,x„, v) and our hypothesis, we get u — v. The 
deduction theorem yields ^(jq, . . . ,x„, n) A ^(x \ , . . . ,x„, v) => u = v. From 
(i) and (ii), (3 1 y) < ^’(vi, . . . , x„ , y) . Thus, we have proved (3jy) &(xi , . . . , 
x„,y) =» (3iy)^(xi, . . . ,x ny y). 

Case 2. Take -<(3iy)&(xi y . . . ,x fn y) as hypothesis, (i) Our hypothesis, 
together with the theorem 0 = 0, yields, by conjunction introduction, 
->(3i y)38{xi, . . . , x n , y) A 0 = 0. By disjunction introduction, ^(jq, . . . , x „ , 0), 



FORMAL NUMBER THEORY 


172 


and, by rule E4, {By)^(x\,. . . ,x n ,y)- (ii) Assume ^(xi, . . . ,jc Hi w)a 
^(x\ ,x„, v). From &(xi, ... ,x„,u) and our hypothesis, it follows easily 
that u = 0. Likewise, from ^(x \ , . . . , x„, v) and our hypothesis, v = 0. Hence" 
u = v. By the deduction theorem, &(xi , . . . , x m u) A ^(xi, . . . v) 

From (i) and (ii), (3iy) < %(x\,... t x„ i y). Thus we have proved ^(3iy) 
..,x„,y)=> (Biyfflxi, . . . ,x„,y) 

By case 1 and case 2 and an instance of the tautology [(£)-$, 
E ) A (-I D => E)\ =$■ E, We can obtain bj^ (B\y)<g(xu . . . ,x„,y). 

Since we have proved them to be equivalent, from now on we shall use 
representability and strong representability interchangeably. 

Observe that a function representable in K is representable in any ex- 
tension of K. 

Examples 

In these examples, let K be any theory with equality in the language <£ A , 

1. The zero function, Z(x) = 0, is representable in K by the wf jq = 
x\ A y = 0. For any k and m, if Z(k) = wz, then in = 0 and 

k = k A 0 = 0; that is, condition 1 holds. Also, it is easy to show that 
l~K (3ij>)(xi =xi A y = 0). Thus, condition 2' holds. 

2. The successor function, N(x) = x 1 , is representable in K by the wf 

y = aq. For any k and m , if N(k) = m, then m = k + 1; hence, in is k . Then 
l~K in = k . It is easy to verify that bj^ — x x ). 

3. The projection function, UJ(x\, . . . ,x„) = xj, is representable in K by 
x\ = xj A X 2 = X 2 A . . . A x„ = x„ A y = Xj. If ,...,&„) = w, then m = 
kj. Hence, bj^ k\ = k\ A k 2 = &2 A . . . A k„ = k„ A m = kj. Thus, condi- 
tion 1 holds. Also, bj£ (3iy)(xi = .jq A *2 = *2 A . . . A = x„ Ay ~ xj), 
that is, condition 2! holds. 

4. Assume that the functions g(x \ , . . . ,x,„),hi(x\ ,.. . ,x„), . . ., h m (x | , . . . ,x„) 

are strongly representable in the theory with equality K by the wfs 
^(^■i t ■ - • j Xm i Z)) (xj , ... 3 Xfj , y \ ) , ... j&fn(x\) ... , Xu , yjji ) , respectively. 

Define a new function / by the equation 

/(^l 5 ■ ■ " ${,k 1 (^1 5 ■ ■ ' 5 Xji) , . ■ . , hfll (xj , - . * 5^)] 

/is said to be obtained from g,hy, . . . ,h m by substitution. Then /is also 
strongly representable in K by the following wf £t>(x \, . . . ,x„ f z): 

(3yi ) ■ ■ ■ (By tn ) (a'i , . . . , x n , ) A . . . A SS tn (q , . . . , x n , y)n ) A *&(yi j • ■ • , y'm . z)) 

To prove condition 1, let f(k\,... ,k„) = p. Let hj (k\ ,... ,k„) = rj for 
1 ^j^m; then g(r\ , . . . , r,„) = p. Since ^ , . . . , represent g 7 hi,... f 
h m , we have bj^ $j{k\,. . . for 1 and bj^ ^(n, . . . ,T m ,p). 

So by conjunction introduction, bj^ $i(k \, . . .,k„,n) A . . . A (ki , . . . , 

kn,r m )k&(zi ■ ■ -7- m ,p). Hence, by rule E4 , bjA S){k\^. . . , k„,p). Thus, 
condition 1 holds. Now we shall prove condition 2’ . Assume <$(x \, . . . , 
x„, u) A c Jl{x\ , . . . , x „ , v), that is 
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(A)(3yj) . . . (3y /w ) {j$\ {xi , . . . , Xfj , y \ ) A ... A ^/m(xj, . . . , Xjj > Vm ) A ^(vi, - - ■ w)) 

and 

( D ) (3yi ) • • ■ (3y/n ) i (xi 5 ■ • • i *U i jVi ) A ... a (xi , . . . , Xu 1 \'m ) a &(yi , ■ ■ • , » U ) 

By(A), using rule C m times, 

(xi , . . . ,x„, b\ ) A . . . A 38 m (xi , . . . ,x„, b m ) A #(Z>i, . . . , b mf u ) 

By(D) using rule C again, 

. . . ,x„,ci) A .. . A . . , x„ , c M ) A #(ci, . . . ,c,„,u) 

Since bj^ (E?uy).$$y(xi , . . . , x„ , yj ) , we obtain from . . ,x„,bj ) and 

. . . ,x„,cy), that bj — cj. From ^(6i, . . . ,b m it) and hi=ci,..., 
= c„„ we have ^{c\ ,c m , u). This, with bj^ (3iz)^(xi, . . . ,x„,z) and 
^(ci , . . . , c m , v ) yields u = v. Thus, we have shown bj^ S>(xi , . . . , 
x,„ it) A S>(xi , . . . ,x„, v) => w = v. It is easy to show that bj ^(3z)£0 
(xi, . . . ,x„,z). Hence, b^ (3iz)^(xi, . . . ,x„,z). 


Exercises 

3.11 Let K be a theory with equality in the language Jt? a- Show that the 
following functions are representable in K. 

(a) Zji(xi) . . . yXfj) 0 [hiwl .Zj](x i, . . . ^x f1 ) Z{CJ-^ (xj, . . . ,x n )).] 

(b) C^(xi, . . . ,x„) = k, where A; is a fixed natural number. [Hint: Use 
mathematical induction in the metalanguage with respect to A:.] 

3.12 Prove that addition and multiplication are representable in S. 

If R is a relation of n arguments, then the characteristic function Cr is 
defined as follows: 

, . f 0 if R(xi, . . .x„) is true 

M * 1 ’ ' ' ' ' Xn) ~ \ 1 if R(x i , . . . ,x„) is false 


PROPOSITION 3.13 

Let K be a theory with equality in the language such that bj^ 0^1. 
Then a number-theoretic relation R is expressible in K if and only if Cr is 
representable in K. 


Proof 

If R is expressible in K by a wf R8(x \, . . . ,x„), it is easy to verify that Cr is 
representable in K by the wf (@}(x \ , . . . ,x„) A y — 0) V (~>R3{xi , . . . ,x„) 
AT = 1). Conversely, if Cr is representable in K by a wf ^(xi, . . . ,x„, y). 
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then, using the assumption that 0^1, we can easily show that R i s ,® 
expressible in K by the wf &(xi, . . . ,x„, 0). 


Exercises 

3.13 The graph of a function f{x is the relation 
/(xi,... ,x n ) = x„ + i. Show that f(x i, . . . ,x„) is representable in S if and only 
if its graph is expressible in S. 

3.14 If i Q and R are relations of n arguments, prove that C no t-R = 1 — CY 
C{q or R) = C Q ' Cr, and C(g an dR) = Cq + C R — Cq • C R . 

3.15 Show that f(x \ , . . . , x„) is representable in a theory with equality K in 
the language SR a if and only if there is a wf RS{x\ ,... ,x„,y) such that, for any 

if f(ku ... ,k„) = m, then (Vy)(#(£i, . . . ,k„,y) <^y — m). 

3.3 PRIMITIVE RECURSIVE AND RECURSIVE FUNCTIONS 


The study of representability of functions in S leads to a class of number- 
theoretic functions that turn out to be of great importance in mathematical 
logic and computer science. 


DEFINITION 

1. The following functions are called initial functions. 

(I) The zero function, Z(x) = 0 for all x. 

(II) The successor function, N(x) — x + 1 for all x. 

(III) The projection functions, U?{x \, . . . ,x n ) — x/ for all jq, . . . ,x„. 

2. The following are rules for obtaining new functions from given functions. 

(IV) Substitution: 

f{X\y . . • yX n ) 0(.hl(x lj . - - yX/jy . . . ,h,y |(xi, . . . )X n )) 

/ is said to be obtained by substitution from the functions 

> • • • )hn)» h\ (.Xj , ■ . . yX n ), . . . , hm{x\ > - ■ • jX t j 

(V) Recursion: 

f {X\ y ■ ■ ■ y Xfl y 0) (j(X{ y ... y Xjj 

fix 1, • ■ • ,x„,y + 1 ) = h(x 1, . . . ,x„,y,f(x i, . . . ,x„,y)) 

Here, we allow n — 0, in which case we have 

/( 0) = k where A: is a fixed natural number 

fiy + 1) =Ky,f{y)) 
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We shall say that / is obtained from g and h (or, in the case n = 0, from 
h alone) by recursion. The parameters of the recursion are jq, . . . ,x„. 
Notice that f is well defined: f(x i, . . . ,x fn 0) is given by the first equa- 
tion, and if we already know f(xi, . . . ,x„,y), then we can obtain 
f{x \, . . . ,x„,y + 1) by the second equation. 
t(VI) Restricted p-Operator. Assume that g(xi , . . . ,x„,y) is a function such 
that for any jq , . . . , jc„ there is at least one y such that 
g(xi , . . . ,x„,y) = 0. We denote by py(g(x i , . . . ,x„,y) = 0) the least 
number y such that g(xi , . . . ,jq„y) = 0. In general, for any relation 
R(xi , . . . , x „ , y ) , we denote by pyR(xi , . . . ,x, u y) the least y such that 
R(xi, . . . »,y) is true, if there is any y at all such that R(jq, . . . ,x fl ,y) 

holds. Let f{x \, . . . ,x„) = fiy(g(x i, . . . ,x„,y) = 0). Then f is said to be 
obtained from g by means of the restricted ^-operator if the given 
assumption about g holds, namely, for any jq, . . . , x „ , there is at least 
one y such that g(x i, . . . = 0. 

3. A function / is said to be primitive recursive if and only if it can be 

obtained from the initial functions by any finite number of substitutions 
(IV) and recursions (V) - that is, if there is a finite sequence of functions 
/ 0 , . . . ,f„ such that f„ — f and, for either f\ is an initial function 

or fj comes from preceding functions in the sequence by an application of 
rule (IV) or rule (V). 

4. A function / is said to be recursive if and only if it can be obtained from 
the initial functions by any finite number of applications of substitution 
(IV), recursion (V) and the restricted /^-operator (VI). This differs from 
the definition above of primitive recursive functions only in the addition 
of possible applications of the restricted p -operator. Hence, every pri- 
mitive recursive function is recursive. We shall see later that the converse 
is false. 

We shall show that the class of recursive functions is identical with the 
class of functions representable in S. (In the literature, the phrase ‘general 
recursive’ is sometimes used instead of ‘recursive’.) 

First, let us prove that we can add ‘dummy variables’ to and also permute 
and identify variables in any primitive recursive or recursive function, ob- 
taining a function of the same type. 


PROPOSITION 3.14 

Let g(yi , . . . , ) be primitive recursive (or recursive). Let jq , . . .x„ be distinct 
variables and, for l^i^k, let Zj be one of jq, . . . ,x„. Then the function / 
such that f(x \, . . . ,x„) — g(z \, . . . ,z*) is primitive recursive (or recursive). 
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Let Zi = xj. , where 1 ^ y, < n. Then z, — Uj.(x x , ... ,x„). Thus, 

f(X\ j • • * (X| ) • • ■ ) U/j (-*1 1 ■ j-^h)) 

and therefore / is primitive recursive (or recursive), since it arises from 

g, U?,. .. ,Uj k by substitution. 

Examples 

1 . Adding dummy variables. If g(x 1,^:3) is primitive recursive and if 
f{xuX2ix{)=g(x\,x{), then /(xi,X2,X3) is also primitive recursive. In 
Proposition 3 . 14 , let z x = x x and z 2 = X3. The new variable xj is called a 
‘dummy variable’ since its value has no influence on the value of 
f(xi,x 2 ,xi). 

2 . Permuting variables. If g(x x ,x2,m) is primitive recursive and if 
f(x i,x2,X3) = <?(jt3,xi,jc2) 5 then /(xi,x2,x3) is also primitive recursive. In 
Proposition 3 . 14 , let z x = X3 ,z 2 = xj and zi — x 2 . 

3 . Identifying variables. If g(xi,X2,X3) is primitive recursive and if 
f{x i,x 2 ) = g{x\ j x'2, xj ), then f{x x ,x2) is primitive recursive. In Proposi- 
tion 3 . 14 , let n = 2 and z x = x x ,z 2 — x 2 and Z3 = jq. 


COROLLARY 3.15 

(a) The zero function Z„(x x , . . . ,x„) = 0 is primitive recursive. 

(b) The constant function C£(x 1, . . . ,x„) = k, where k is some fixed natural 
number, is primitive recursive. 

(c) The substitution rule (IV) can be extended to the case where each /z, may 
be a function of some but not necessarily all of the variables. Likewise, 
in the recursion rule (V), the function g may not involve all of the 
variables xj, ... ,x„,y, or f(x x , . . . ,x„,y) and h may not involve all of the 
variables xi , . . . ,x„,y, or f(x x , . . . ,x,,,y). 

Proof 

(a) In Proposition 3 . 14 , let g be the zero function Z; then k — 1 . Take z x to be 

X\. 

(b) Use mathematical induction. For k = 0 , this is part (a). Assume C[' 
primitve recursive. Then C|' ( (xi, . . . ,x„) is primitive recursive by the sub- 
stitution Cjj? +1 (xi , . . . ,x„) = N(Cl(x 1, . . . , x n ) ) . 

(c) By Proposition 3 . 14 , any variables among xi, . . . ,x H not present in a 
function can be added as dummy variables. For example, if ft(xi,x3) is 
primitive recursive, then h* (xi , X2 , X3 ) = h (xi , X3 ) = h ( Uf (xi , X2 , X3 ) , U\ (xj , 
JC 2? ^3)) is also primitive recursive, since it is obtained by a substitution. 
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PROPOSITION 3.16 


The following functions are primitive recursive. 

s (a) x+ y 

(b)x-y 


c/ f x — 1 if x > 0 

*M = io if* = 0 

5 is called the predecessor function. 


{ x—y if x^y 

0 if x < y 

. i fx — y if x^y 
TO if x < y 

, . f 0 if x = 0 

(g)sg(^)~| 1 ifjf ^ 0 

. . f 1 if x = 0 

(h)sg(x)-| 0 ifx ^ 0 

(i) x! 

0 min(x,y) — minimum of x and y 
0c) minfxj,...,^) 

(1) max(x,y) = maximum of x and y 

(m)max(jci, . . . ,x„) 

(n) rm(x,y) = remainder upon division of y by x 

(o) qt(x,y) = quotient upon division of y by x 


(e) x 


(0 I*-tI 


* —y 
0 


(g) sg(x) 

(h) sg(x) 


M; 

-{! 

-{i 


Proof 

(a) Recursion rule (V) 

x + 0 = x or /(x, 0) = U{ (x) 

x + (y + 1) = N(x +y) f(x 7 y + 1) = N(f(x,y)) 

(b) x - 0 = 0 or gf(x, 0) = Z(x) 

x ■ {y + 1) = (* • y) + x (x,y + 1) = /(g(x,y), x) 

where f is the addition function 

(c) x° = 1 

ju^+l _ ^.v) . x 

(d) <5(0) = 0 

% + i) -y 

(e) x— 0 = x 
x-(y + l) = £(x-y) 

(f) |x — y| = (x—y) + (y— x) (substitution) 

(g) sg(x) = x— <5(x) (substitution) 
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(h) sg(x) — 1— sg(x) (substitution) 

(i) 0! = 1 

(v + l)! = (;'!) • 0+1) 

(j) minO, y) = x-(x -y) 

(k) Assume min(jti , . . . , x „ ) already shown primitive recursive. 

min(xi , . . . ,x n ,x n+1 ) — min(min(xi, . . . , i ) 

(l) max 0, y) = y + {x-y) 

(m) max(xi , . . . , x n , x /!+ i ) = max(max(xi ,x„), x„ + i ) 

(n) rm(x, 0) = 0 

rm(x,y + 1) = A(rm(x,y)) ■ sg(|* -j\f(rm(x,j/))|) 

(o) qt(x, 0) = 0 

qt(x,y+ 1) = qt(x,y) +sg(|x - N(rm(x,y))\) 

In justification of (n) and (o), note that, if q and r denote the quotient 
qt(x,y) and remainder rm(jt,y) upon division of y by x, then y = qx + r and 
O^rCx. So, y + 1 = qx + (r+ 1). If r+iCx (that is, if \x — 7V(rm 
(x,y))| > 0), then the quotient qt(x,y+ 1) and remainder rm(x,j; + 1) upon 
division of y + 1 by x are q and r + 1 , respectively. If r + 1 = x (that is, if 
\x — W(rm(jE, < y))| = 0), then y + 1 = (# + l)x, and qt(x,y+l) and 
rm(x,y + 1) are q+ 1 and 0, respectively.! 


DEFINITIONS 


^/0i,...,x„,y) = 


y<z 


ffa i > ■ ' ■ > 5 y) 

y ^ z 


n/O n ...,x n ,y) 

y<z 

n/0i,...,x„,y) 

y^z 


ro if z = o 

1 f(X\ ? - ■ ■ ? x n > 0) H h /(*, , 

^ ^ /(*, 7 ■ ■ • 7 X fl 7 y) 
y<z+l 

r i if z - o 


* • 7 X„ 7 Z' 


l /ft 5 ■ ' " ) X„ 5 0) ' » » • f fax 5 , X f1 ,2 1 ) 

" n f(xu * ■ ■ j x ni y) 

y<z-h 1 


if z > 0 


if z > 0 


These bounded sums and products are functions of x \, . . . We can also 

define doubly bounded sums and products in terms of the ones already 
given; for example, 


^ ^ /(*! J • * - J Xn j y) f {x I 7 • * ■ 7 Xfl ) ti H” 1 ) "f ■ ■ * T f{X\ } ■ ■ • 5 Xfi 5 v 1 ) 

u < y < v 

— /Oi,...,X„,y + u+ 1) 

y < 5(v—u) 


! Since one cannot divide by 0, the values of rm(0,y) and qt(0,y) have no in- 
tuitive significance. It can be easily shown by induction that the given definitions 
yield rm(0,y) — y and qt(0,y) = 0. 
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PROPOSITION 3.17 

If f(x u . . . ,x„,y) is primitive recursive (or recursive), then all the bounded 
sums and products defined above are also primitive recursive (or recursive). 


Proof 


Let g(x i , . 
sion: 


. . ,x„,z ) = f( x U' ■ • A»d)- Then we have the following recur- 

y<z 

0(x ly . 0) = 0 

g(X\ j ■ * * 5 1 ) 0&1 > ■ • ■ ) **7i j f fa 1 ? • - ■ ? %n j 


If hfa ij • * • y v ./T^i ? * • * then 

y < z 

h (xj , . . . , x n , z) = g(x\ , . . . , x„ , z + 1 ) (substitution) 


The proofs for bounded products and doubly bounded sums and products 
are left as exercises. 


Example 

Let t(x) be the number of divisors of x, if x > 0, and let t(0) = 1. (Thus, t(x) 
is the number of divisors of x that are less than or equal to x.) Then x is 
primitive recursive, since 

<*) = ^ sg(rm(y,x)) 

y ^ x 

Given expressions for number-theoretic relations, we can apply the 
connectives of the propositional calculus to them to obtain new expressions 
for relations. For example, if Ri(x,y) and Ri(x, u, v) are relations, then 
Ri(x,y) A Ri(x, u, v ) is a new relation that holds for x,y, w, v when and only 
when both f?i(x,y) and Ri(x, a, v) hold. We shall use (Vy) <z f?(jq, . . . , x n , y ) 
to express the relation: for all y, if y is less than 2 , then R(x \ , . . . ,x„,y) holds. 
We shall use (Vy)^^ (3y) <z and (3y) <z in an analogous way; for ex- 
ample, (3y) <z i?(xi, . . . ,x„,y) means that there is some y < z such that 
R(x U ...,x» y y) holds. We shall call (Vy) y<z , (Vp) J , <z (3^) y<z and (3y) >I<z 
bounded quantifiers. In addition, we define a bounded p-operator: 

{ the least y < z for which R(x , , . . . , x,„ y) 
holds if there is such a y 
z otherwise 

The value z is chosen in the second case because it is more convenient in later 
proofs; this choice has no intuitive significance. We also define 
py y ^ z R(x 1 , . . . ,x n ,y) to be py y<z+ \R(x \, . . . ,x„,y). 

A relation R{x \ , . . . ,x„) is said to be primitive recursive (or recursive) if and 
only if its characteristic function C R (x \, . . . ,x„) is primitive recursive (or re- 
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Ji 


cursive). In particular, a set A of natural numbers is primitive recursive (of 
recursive) if and only if its characteristic function C 4 (x) is primitive recursive 
(or recursive). 


Examples 

1 . The relation xi = X 2 is primitive recursive. Its characteristic function H 
sg(jxi — X 2 I), which is primitive recursive, by Proposition 3.16(f,g). 

2. The relational < X 2 is primitive recursive, since its characteristic function 
is sg(x 2 — * 1 ), which is primitive recursive, by Proposition 3.16(e,h). 

3 . The relation xi |x 2 is primitive recursive, since its characteristic function is 
sg(rm(xi , x 2 )) . 

4. The relation Pr(x), (x) is a prime, is primitive recursive, since 
C Fr (x) = sg(|t(x) — 2|). Note that an integer is a prime if and only if it has 
exactly two divisors; recall that t( 0 ) = 1 . 


PROPOSITION 3.18 

Relations obtained from primitive recursive (or recursive) relations by 
means of the propositional connectives and the bounded quantifiers are also 
primitive recursive (or recursive). Also, applications of the bounded ^-op- 
erators py y<z and py y ^ z lead from primitive recursive (or recursive) rela- 
tions to primitive recursive (or recursive) functions. 


Proof 

Assume i?i(xi, . . . ,x„) and R- 2 (x\ , . . . ,x„) are primitive recursive (or re- 
cursive) relations. Then the characteristic functions C R] and C Rl are primi- 
tive recursive (or recursive). But C^ R] (xj, . . . ,x„) = l—C/?, (xi, . . . ,x„); hence 
- 1 R 1 is primitive recursive (or recursive). Also, C R} y Rl (xi, . . . ,x„) 
— C R , (xi , . . . ,x„) • C R? (xi , . . . ,x„); so, R\ V R 2 is primitive recursive (or re- 
cursive). Since all propositional connectives are definable in terms of -1 and 
V, this takes care of them. Now, assume R(x 1 , . . . , x„ , y) is primitive recursive 
(or recursive). If Q{x 1 , . . . , x„, z) is the relation (3y) <z R(xi , . . . , x„,y), then it 
is easy to verify that Cq(xi, . . . ,x„,z) = U y<z C R (xi, . . . ,x„,y), which, by 
Proposition 3.17, is primitive recursive (or recursive). The bounded quan- 
tifier (3y) y<z is equivalent to (3y) y<z+1 , which is obtainable from (3y) y<z by 
substitution. Also, (Vy) y<z is equivalent to -i(3y) y<z -i, and (Vy) < z is 
equivalent to -i(3y) y<z -i. Doubly bounded quantifiers, such as (3y) w< <v , 
can be defined by substitution, using the bounded quantifiers already 
mentioned. Finally, IT, (xi , . . . ,x„, a) has the value 1 for all y such that 
R(x 1 , ... ,x„,i/) is false for all u it has the value 0 as soon as there is some 
u ^ y such that i?(xj , . . . ,x„, u) holds. Hence, ^ <z (II w <yQ?(xi, ■ • • , x n , «)) 
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counts the number of integers from 0 up to but not including the first y < z 
sU ch that R{x\ , . . . , x„ , y) holds and is z if there is no such y; thus, it is equal 
to uy y <zR( x i, . . . ,x„,y) and so the latter function is primitive recursive (or 
recursive) by Proposition 3.17. 


Examples 

JJ Let p(x) be the x t /, prime number in ascending order. Thus, 
«(0) = 2, p(l) = 3, p( 2) = 5, and so on. We shall write p x instead of 
p(x) . Then p x is a primitive recursive function. In fact, 

Po = 2 

Px+i = m ^ (p x y.+ i {p x < y A Pr(y)) 

Notice that the relation u <y f\ Pr(v) is primitive recursive. Hence, by 
Proposition 3.18, the function < y APr(y)) is a primitive re- 

cursive function g(u, v). If we substitute the primitive recursive functions 
z and z! + 1 for u and v, respectively, in g(a, v), we obtain the primitive 
recursive function 


h(z) = py y * z .+i (z < y A Pr(y)) 

and the right-hand side of the second equation above is h(p x ); hence, we 
have an application of the recursion rule (V). The bound (p x )\ + 1 on the 
first prime after p x is obtained from Euclid’s proof of the infinitude of 
primes (see Exercise 3.23). 

2. Every positive integer x has a unique factorization into prime powers: 
x = pQ ] p e l' . . .p a k k . Let us denote by (x) ■ the exponent cij in this factor- 
ization. If x = 1 , (x)j = 1 for all j. If x = 0, we arbitrarily let (x) ■ = 0 for 
all j. Then the function (x) . is primitive recursive, since (x)j = 

m<x(Pj\ x A !*))- 

3. For x > 0, let dh{x) be the number of non-zero exponents in the factor- 
ization of x into powers of primes, or, equivalently, the number of dis- 
tinct primes that divide x. Let £h( 0) = 0. Then £h is primitive recursive. 
To see this, let i?(x,y) be the primitive recursive relation 
Pr(y) Ay\x Ax ^ 0. Then £k(x) <x sg(C R {x,y)). Note that this 
yields the special cases £Ti(0) = £Ti(l) = 0. The expression TT(x)’ should 
be read ‘length of x ’. 

4. If the number x = 2 a ° 3 fll . . . p a k k is used to ‘represent’ or ‘encode’ the 
sequence of positive integers r/o,«i, • * • and y = 2 /j °3 /; ' . . ,p } ’p ‘re- 
presents’ the sequence of positive integers ho, hi, ... , h wl , then the 
number 

‘represents’ the new sequence ao,a \,. . . , tf£,ho,hi, . . . ,b m obtained by 
juxtaposing the two sequences. Note that £ft(x) =k-\- 1, which is the 
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length of the first sequence, tTi (_y) = /w + l, which is the length of tU 
second sequence, and bj = (y) .. Hence, 

x*y=x- JJ ipen(x)+j) (y)j 
j<en tv) 

and, thus, * is a primitive recursive function, called the juxtaposition 
function. It is not difficult to show that x * (y * z) = (x * y) * z as long as 
0 (which will be the only case of interest to us). Therefore, there is no 
harm in omitting parentheses when writing two or more applications of 
*. Also observe that x * 0 = jc * 1 = jc. 

Exercises 

3.16 Assume that R(x i, . . . ,jt H ,y) is a primitive recursive (or recursive) re- 
lation. Prove the following: 

( a ) (3y) u<y<v R (xi , • • • > x „ , y ) , (3y) w <y<v R(x i , • • • , x n , y) and (3y) u<y<v R (x u 
. . . ,x„,y) are primitive (or recursive) relations. 

(b) yy\ i <.)><_ iR (x \ , . . . , x n , y) , y\'i/ < i, R (x I , ... , x /: , \ ) and ny ; u ^.y<vR (^ i ; ■ * • 
are primitive recursive (or recursive) functions. 

(c) If, for all natural numbers jq, . . . , x n , there exists a natural number y such 
that P(jci, . . . , x„ , y ) , then the function f(pc i, . . . , x tl ) = pyR(x j, . . . ,x„,y) is; 
recursive. [Hint: Apply the restricted /^-operator to C R (x \ , . . . ,x„,y).] 

3.17 

(a) Show that the intersection, union and complement of primitive recursive 
(or recursive) sets are also primitive recursive (or recursive). 

(b) Show that every finite set is primitive recursive. 

3.18 Prove that a function f{x\, . . . , x „ ) is recursive if and only if its re- 
presenting relation f(x i, . . . , x „ ) = y is a recursive relation. 

3.19 Let [\fn] denote the greatest integer less than or equal to y'n, and let 
n (n) denote the number of primes less than or equal to n. Show that [i//i] 
and n(w) are primitive recursive. 

3.20 Let e be the base of the natural logarithms. Show that [ne], the greatest 
integer less than or equal to ne, is a primitive recursive function. 

3.21 Let RP(y,z) hold if and only if y and z are relatively prime, that is, y 
and z have no common factor greater than 1. Let (p(n) be the number of 
positive integers less than or equal to n that are relatively prime to n. Prove 
that RP and (p are primitive recursive. 

3.22 Show that, in the definition of the primitive recursive functions, one 
need not assume that Z(x) = 0 is one of the initial functions. 

3.23 Prove that pk+i < {popi • • - pic) + L Conclude that pk+i ^ pk ! + L 

For use in the further study of recursive functions, we prove the following 
theorem on definition by cases. 
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PROPOSITION 3.19 
let 

' c/i(x i, . . . ,x „ ) if Ri (x\,...,x n ) holds 
g 2 (xi , . . . ,x„) if R 2 (x \,...,x„) holds 

f(X\ , ■ • • , X„) i 

« 

, </*(xi ,-•-,*») ^ R*(xi , . . . ,x„) holds 

If the functions gi,...,gk and the relations Ri , . . . , are primitive recursive 
(or recursive), and if, for any xi,...,x,„ exactly one of the relations 
- ,x«), ■ ,Rk(x is true, then / is primitive recursive (or re- 

cursive). 


Proof 

• • • )Xjj) gi(xii - ■ • >x n ) • sg(C/j| (xj , . . . ,x,j)) + . . . 4- 

g&'(x l) • • • )X») * sg(C)^.(xi, . . . , X f j ) ) . 

^Exercises 


3.24 Show that in Proposition 3 . 1 9 it is not necessary to assume that R^ is 
primitive recursive (or recursive). 

3.25 Let 



if x is even 
if x is odd 


Prove that / is primitive recursive. 


3.26 Let 



2 if Goldbach’s conjecture is true 
1 if Goldbach’s conjecture is false 


Is h primitive recursive? 

It is often important to have available a primitive recursive one-one 
correspondence between the set of ordered pairs of natural numbers and the 
set of natural numbers. We shall enumerate the pairs as follows: 

( 0 , 0 ), ( 0 , 1 ), ( 1 , 0 ), ( 1 , 1 ), ( 0 , 2 ), ( 2 , 0 ), ( 1 , 2 ), ( 2 , 1 ), ( 2 , 2 ), ... 

After we have enumerated all the pairs having components less than or 
equal to k, we then add a new group of all the new pairs having components 
less than or equal to k+\ in the following order: (0 , k + 1), {k + l, 0), 
(1 , k 4 1 ) , (k -(-1,1),..., ( k , k 4" 1 ) , ( k -f- 1 , /r) , ( k -j-l,/c-f-l). If x <c y, then 
(x, y) occurs before (y, x) and both are in the (y -f- l)th group. (Note that we 
start from 1 in counting groups.) The first y groups contain y 2 pairs, and 
(x,y) is the (2x 4 1 )th pair in the (y+l)th group. Hence, (x,y) is the 
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(y 2 E 2x + l)th pair in the ordering, and (y,x) is the (y 2 + 2x + 2)th pair. Q n 
the other hand, if x = y, (x,y) is the ((x + l) 2 )th pair. This justifies the fop 
lowing definition, in which <r 2 (x,y) denotes the place of the pair (x,y) i n th e 
above enumeration, with (0, 0) considered to be in the 0th place: 

<r 2 (x,y) - sg (x-y) ■ (x 2 + 2y-f 1) \ sg(x-y) • (y 2 E Zv) 

Clearly, er 2 is primitive recursive. 

Let us define inverse functions a\ and c\ such that 
aii^fay)) =y and c 2 (g\(z) , a\(z)) = z. Thus, a 2 (z) and 

o\{z) are the first and second components of the zth ordered pair in the given 
enumeration. Note first that a 2 ( 0) = 0, = 0, 

( o\{n) if < o\ (« ) 

o\{n) - < o\(n) E l if of(w) > c\(n) 

[ 0 if o 2 (h) = o\{n) 

and 



if fj 2 (») ^ o\(n) 
if c[{n) = al(n) 


Hence, 

<t 2 (w+ 1) = o\{n) ■ (sg(<r|(«) — («) )) E (^2(») + J ) • ( w )~ 

o\(n E 1) = o\(n) • (sg(|<r|(«) - <r?(n)|)) E (of(») El)* (sg(l<^(«) - o?(«)|)) 

= 'K a i(”)> <%(”)) 

where (p and \J/ are primitive recursive functions. Thus, a\ and are defined 
recursively at the same time. We can show that a\ and a\ are primitive 

recursive in the following devious way. Let h(u) = 2 c i^ u >3 (T ^ tl \ Now, h is 

primitive recursive, since /?(0) = 2 fr i(°)3°2^ = 2° • 3° = 1, and h(n E 1) 
_ 2^(«+i) 3^( w + 1 ) = - 2*p(C*(«)) 0 >(*("))i) 3 ^((a(»))o.(*(»))j) > 

Remembering that the function (x). is primitive recursive (see Example 2 on 
page 181), we conclude by recursion rule (V) that h is primitive recursive. 
But <t 2 (x) = (/z(x)) 0 and o|(x) = (/z(x))j. By substitution, c\ and are 
primitive recursive. 

One-one primitive recursive correspondences between all n-tuples of 
natural numbers and all natural numbers can be defined step — by - step, 
using induction on n. For n = 2, it has already been done. Assume that, for 
n =k, we have primitive recursive functions <r*(xi , . . . ,x^), <t^(x), . . . , ff£(x) 
such that o|'(t/(xi, . . . ,x^)) = x,* for and 0 ^( 0 ^ (x), . . . , <r£(x) — x. 

Now, for n=k + 1, define <r* +1 (xi, . . . ,xt,x^ + i) = <r{c k (x 1 , . . . ,xt),xjt + i), 
<t| +1 (x) = <t^(<t 2 (x)) for l^i^k and o£+j(x) = {^(x). Then f/ +l , < 7 j +l , 
. . . , are all primitive recursive, and we leave it as an exercise to verily 
that <rj 4 1 (xj , . . . , xjt + i )) = Xi for 1 < i < k E I , and (x), . . . , 

<4X\ ( x )) = x. 
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It will be essential in later work to define functions by a recursion in 
w hich the value of f{x\, + 1) depends not only upon f(x i, . . . ,x„,y) 

but also upon several or all values of f(x \, . . . , x n , u) with u^y. This type of 
recursion is called a course- of -values recursion. Let ff(xi, . . . ,x„,y) = 

Note that f can be obtained from fff as follows: 

fix 1 , . • • ,x n ,y ) = (/#(*!, ...,x„,y+ l)) r 


PROPOSITION 3.20 (COURSE-OF-VALUES RECURSION) 

-O 

;jf h(x \ , . . . ,x„,y,z) is primitive recursive (or recursive) and f(x i, . . . , x„ , y ) 
*s= h(x i, . . . ,x tt ,y,fff{x i, . . . ,x„, j)), then / is primitive recursive (or re- 
cursive). 

proof 


f f^ixi i • • • i x„, 0) — 1 

f #(*i , . - - ,x„,y + 1) = /#(*, , . . . ,x„,y) - 

= /#(xi,. . . , x„ , v) 


Thus, by the recursion rule, fjf is primitive recursive (or recursive), and 
fix i j • - ■ if ft~{x l j • • • i -j- 1))^,. 


Example 

The Fibonacci sequence is defined as follows: /( 0) = 1,/(1) = 1, and 
f(k + 2) — f{k) +/(/c + 1 ) for A^O. Then f is primitive recursive, since 

/(«) = Sg(«) + Sg(l« - 1 1) + ((/#(«)Ul + C/#(«))„-2) ■ Sg(H-l) 

The function 

A(y,z) = sg(y) + sg(|y - 1|) + ((z) r _i + (4,.^) ' s ^(} ; - 1 ) 
is primitive recursive, and f[ri) = h{n,fff(ti)). 


Exercise 

3.27 Let g(0) = 2, g(l) = 4, and g(k + 2) = 3 g(k + 1) — {2g(k) + 1). Show 
that g is primitive recursive. 


COROLLARY 3.21 (COURSE-OF-VALUES RECURSION 
FOR RELATIONS) 

If H{x\, . . . ,x nj y,z) is a primitive recursive (or recursive) relation and 
R(xi,...,x ni y) holds if and only if H{x u - • ■ ,x„,y,{C R )#(x u - • • ,x„,y)). 
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where Cr is the characteristic function of R, then R is primitive recursive (or 
recursive). 


Proof 

C R (x 1 , . . . ,x„,y ) = C H (x 1 , . . . ,x„,y, (C R )#(x u . . . ,x„,y )). Since C H is primi- 
live recursive (or recursive), Proposition 3.20 implies that Cr is primitive 
recursive (or recursive) and, therefore, so is R. 

Proposition 3.20 and Corollary 3.21 will be drawn upon heavily in what 
follows. They are applicable whenever the value of a function or relation for 
y is defined in terms of values for arguments less than y (by means of a 
primitive recursive or recursive function or relation). Notice in this con- 
nection that R(x\ , . . . ,x„, ii ) is equivalent to Cr(x i, . . . , x„ , u) = 0, which, in 
turn, for u < y, is equivalent to ((Cr)#(x i, . . . , x„ , y) ) (/ = 0. 


Exercises 

3.28 Prove that the set of recursive functions is denumerable. 

3.29 If /o,/i,/ 2 , . . . is an enumeration of all primitive recursive functions 
(or all recursive functions) of one variable , prove that the function f x (y) is 
not primitive recursive (or recursive). 


LEMMA 3.22 (GODEL’S ^-FUNCTION) 

Let /?(jq,X 2 ,JC 3 ) — rm(l + (*3 + 1) -* 2 ,jci). Then (3 is primitive recursive, by 
Proposition 3.16(n). Also , ft is strongly representable in S by the following 
wf Bt(xi,x 2 ,x 3 ,y): 

(3W)(.yi = (1 + (X 3 + 1) • x 2 ) ■ w +y A y < 1 + (X 3 + 1) • x 2 ) 


Proof 

By Proposition 3.11 h s ( 3 iy)Bt(xi,x 2 ,X 3 ,y). Assume P{k\,k 2 ,k?) — m. Then 
k\ = (1 + (A 3 + 1) • A: 2 ) • k + m for some k, and m < 1 + (#3 + 1) • hi. So, 
|- s k\ = (1 + (£3 + 1) • ki) • k + m , by Proposition 3.6(a). Moreover, 
l~s m < l + (£3 + 1) • ki by the expressibility of < and Proposition 3.6(a). 
Hence, hs k\ — ( l + (£3 + l) ■ ki) • k + m Am < 1 + (£3 + 1) • ki from which 
by rule £4, hs Bt(&i, k 2 ,ki, m). Thus, Bt strongly represents fi in S. 


LEMMA 3.23 

For any sequence of natural numbers ko,k \, . . . ,k„, there exist natural 
numbers b and c such that — ki for 0 
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proof 

Let j — max(w,£o>£i) • • • ,k„) and let c = j\. Consider the numbers 
l(j = 1 + (/' + I )c for 0 ^z no two of them have a factor in common other 
than 1 . In fact, if p were a prime dividing both 1 + (z + l)c and 1 + (m + 1 )c 
with 0<z < m < n, then p would divide their difference (m — i)c. Now, p 
does not divide c, since, in that case p would divide both (z + l)c and 
\ -|- (/ + l)c, and so would divide 1, which is impossible. Hence, p also does 
not divide (m — i); for m — i^n ^ / and so, m — i divides j\ = c. If p divided 
iH — /, then p would divide c. Therefore, p does not divide (m — z)c, which 
yields a contradiction. Thus, the numbers zz,-,0<z* < n, are relatively prime 
in pairs. Also, for O^z < n, kj^j ^j\ — c < 1 +(z + l)c = that is, k\ < u,. 
Now, by the Chinese remainder theorem (see Exercise 3.30), there is a 
number b < Uqii\ . . . u„ such that rm(z/,-, b) = k t for O^i^n. But 
/}(£, c, i ) = rm(l + ( i + l)c, b ) = rm(w,-, b) = kj. 

Lemmas 3.22 and 3.23 enable us to express within S assertions about 
finite sequences of natural numbers, and this ability is crucial in part of the 
proof of the following fundamental theorem. 


PROPOSITION 3.24 

Every recursive function is representable in S. 

Proof 

The initial functions Z, N and Uj' are representable in S, by Examples 1- 3 
on page 172. The substitution rule (IV) does not lead out of the class of 
representable functions, by Example 4 on page 172. 

For the recursion rule (V), assume that g(x i, . . . , x n ) and h{x j, . . . , x„ , y, z) 
are representable in S by wfs 38{x \, . . . ,x n+ i) and %>{x \ , . . . ,x, ;+ 3 ), respec- 
tively, and let 

(7) ./'(x 1 5 * - • i 0 ) £/(xi , - - - >Xjf) 

f(xi 3 . . . 3 Xn^y T 1) h{x i , - • ■ iX n ^y^fixy •> • • • iX^y)) 

Now, f(x i, . . . ,x„,y) = z if and only if there is a finite sequence of numbers 
bo,. . . ,b y such that bo = g(x \ , . . . ,x„), b w+i = h(x \ , . . . ,x„, w, b w ) for 
w + 1 and by = z. But, by Lemma 3.23, reference to finite sequences can 
be paraphrased in terms of the function ft and, by Lemma 3.22, /? is re- 
presentable in S by the wf Bt(jci,x 2 ,X 3 ,y;). 

We shall show that f(x\, . . . ,x„,x } , + i) is representable in S by the fol- 
lowing wf $>(x \, . . . ,x w+2 )' 

(3u)(3v)[((3w)(Bt(ii, v, 0, w) A $(x \ , . . . ,x„, w))) A Bt(zz, v,x n+x ,x n+2 ) 

A (Vvv)(vv < x„ + i =>• (3y)(3z)(Bt(u, v, w, y) A Bt(zz, v, w,z) A ^(x\ , 


» « m 


>X|i, w,y,z)))] 
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(i) First, assume that f(x \, . . . ,x m p) = m. We wish to show that 
hs Si{k\ , . . . ,k„,pi m). If p — 0, then m = g(k \ , . . . ,k„). Consider the se- 
quence consisting of m alone. By Lemma 3.23, there exist b and c such that 
P(b, c, 0) = m. Hence, by Lemma 3.22, 

(X) hs Bt(Ac,0,w) 

Also, since m ~ g{k\ ,k„), we have hs £%(k\ , . . . , k n , m). Hence, by rule 
E4, 


(££) hs (3>v)(Bt(fe, c, 0, w) a 3S{k\ , . . . ,k n ,w)) 

In addition, since hs w<£0, a tautology and Gen yield 

(XXX) (Vw)(w <0^ (3y)(3z)(Bt(A,c,vv,y)A(Bt(6, c,w',z)A <£(k \, . . . A,w,y,z))) 

Applying rule E4 to the conjunction of (3E),(3t$) and (XXX), we obtain 
hs *3)(k \ , . . . ,k n ,0,m). Now, for p > 0,/ (k\,. . . , k„,p ) is calculated from the 
equations (I) in p3-\ steps. Let r = f(k\,...,k n ,i). For the sequence of 
numbers /*o ,...,r p , there are, by Lemma 3.23, numbers b and c such 
that p(b, c, i) = fj for 0 </</>. Hence, by Lemma 3.22, h s Bt (b,c, 7,r f ). 
In particular, fi(b,c,0) = rp = / (k \ , . . . ,k n , 0) = g(k\, . . . , k„). Therefore, 
hs Bt (b, c,0_,r 0 )A@(k u ■ ■ • ,k„,r 0 ), and, by rule E4, (i) hs (3w)(Bt(Z?,c,0,w) 
hM{k \ , . . . , k ni w)). Since_ r p — f(k \, . . . ,k„,p) = m, we have fi(b, c,p) 
= m. Hence, (ii) hs Bt(b,c,p, m). For 0 < / < p — 1, fi(b, c, /') = r f —f 
(Mi • ‘ ‘ ,k n ,i) and P(b,c, I + 1) = A+i = f(k \ , - ■ ■ ,k„,i -f 1) _= h(k\, ■ - • 
f(k x ,--- ,k„,i)) = h(k\, • , k n , r,-). Therefore, h s Bt(h,c, ijj) A Bt 

(b,c,/,r i±1 ) A<£(ku ■ • ' ,k, n i,r h r i+ i). By Rule E4, h s (3y)(3z)(Bt (b,c~i,y) 
f\Bt(b,c,i ,z) f\<&(k\,' ,k li2 i,y,z)) So, by Proposition 3.8(b'), (iii) hs 
(Vw)(w < p => (3y)(3z)(Bt(I?,c,w,y) ABt(fc,c, W,z)A ^(k u . . . ,k„, w,y,z))). 
Then, applying rule E4 twice to the conjunction of (i), (ii) 1 and (iii), we obtain 
hs ^(k\, - ■ -k n ,p,m). Thus, we have verified clause 1 of the definition of 
representability (see page 171). 

(ii) We must show that hs (3\x n +i)&i(k\, • ■ > ,k„,p,x n+ 2 ). The proof is 
by induction on p in the metalanguage. Notice that, by what we have 
proved above, it suffices to prove only uniqueness. The case of p = 0 is left 
as an easy exercise. Assume hs {3\x n ^2)^(k \, . . . ,k„,p,x, l+ 2 )- Let a = g 
(ki,..,,k„),P =f(k u .,., k„,p), and y = /(*i,..., K,p3-l) = h (h,..., 
k„,p,P). Then 

(1) \- s ^(ki,.. .,k„,p,p,y) 

(2) h s 8i(k \, . . . ,£«,«) 

(3) h s 3(k \, .. .,k„,p,p) 

(4) h s ^(ki,. ..,k„,p+ l,y) 

(5) h s (3 x x„+2)S#(k \ , . . . ,k„,p,x n+2 ) 


Assume 
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(6) I,x tt+2 ) 

* 

We must prove x „ + 2 = y. From (6), by rule C, 

(a) (3w) (Bt(Z?, c, 0, w) A 3i(k \ , . . . , k „ , w)) 

(b) Bt(£,c,/>+ l,x„ +2 ) 

(c) (Vw)(vv<p4 1 

=► (3y)(3z)(Bt(Z?,c,w,y) A Bt (b,c,w',z) A ^(k y , . . . ,k„,w,y,z))) 

From (c), 

(d)(Vw)(w < p =» (3y)(3z)(Bt(2>,c, w,y) A Bt (b,c,w',z) A *?(*!,. . . w,j/,z))) 

From (c) by rule A4 and rule C, 

(e) Bt(Z?,c } p,d) A Bt(fc,c,/>4- l,e) f\<£[k\ y . . . 

From (a), (d), and (e), 

(f) g>(ki,'..,k„,p,d) 

From (f), (5) and (3), 

(g) d = P 

From (e) and (g), 

(h) <#(k u ... ,k n ,Pyp,e) 

Since /? represents h, we obtain from (1) and (h), 

(i) y = e 

From (e) and (i), 

(j) Bt(6, c,p+ 1, y) 

From (b), (j), and Lemma 3.22, 

(k) x n +2 = Y 

This completes the induction. 

The /i-operator (VI). Let us assume, that, for any jq, . . . ,jt„, there is some 
y such that g{xi , . . . = 0, and let us assume g is representable in S by a 

wf . . . , x„ + 2 ) • Let f{x i, . . . ,x„) = py(g(x y , . . . , x„ , y) — 0). Then we shall 
show that f is representable in S by the wf ^(x y , . . . ,x„ + i): 


£(x\, ■ ■ • ,x„ + i,0) A (Vy)(y < x, I+ i => . . . ,*„,y,0)) 


Assume f(k y , . . . , k„) = m . Then g{k \ , . . . , k„, m) = 0 and, for k < /«, 
g{k\ , . . . , k„, k) 7^ 0. So, h s $(k\ ,k n ,m, 0) and, for k < m , h s 
(k y , ■ « • , k n , k, 0). By Proposition 3.8(b'), b s (Vy) (y <m =*• -*${ky , . . . , 
^„,3^,0)). Hence, hs ^(ki,. . . ,k n ,m). We must also show: h s (3ix w+ i) 


^(kiy...,k ni x„ +i ). It suffices to prove the uniqueness. Assume 
&(ku -A, 0) A (Vy)(y < it =*► (£i , . . . , k„ , y, 0)) . By Proposition 


* 4 « 
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3.7 (o'), l~s m < u V m = it V w < m. Since t- s S{k \ , . . . , k„, iw, 0), we cannot; 
have m<u. Since \- s (VyXy <m^ ,k„,y,0)), we cannot have. 

u<m. Hence, u = m. This shows the uniqueness. 

Thus, we have proved that all recursive functions are representable in S, 

COROLLARY 3.25 

Every recursive relation is expressible in S. 

Proof 

Let R(x \ , . . . ,x„) be a recursive relation. Then its chracteristic function C R is 
recursive. By Proposition 3.24, Cr is representable in S and, therefore, by 
Proposition 3.13, R is expressible in S. 

Exercises 

3.30 a 

(a) Show that, if a and b are relatively prime natural numbers, then there is 
a natural number c such that ac = l(mod b). (Two numbers a and b are 
said to be relatively prime if their greatest common divisor is 1. In 
general, x = y (mod z) means that x and y leave the same remainder upon 
division by z or, equivalently, that x — y is divisible by z. This exercise 
amounts to showing that there exist integers a and v such that 
1 = au + bv.) 

(b) Prove the Chinese remainder theorem: if jq , . . . , x k are relatively prime in 

pairs and yi , . . . , y k are any natural numbers, there is a natural number z 
such that z = y\ (modxi), . . . , z = ^(mod**). Moreover, any two such zs 
differ by a multiple of xi . . . [Hint: Let x = xi . . . x^ and let 
x = vvjXj = vv2*2 = . • ■ = Wfrjty. Then, for 1 wj is relatively prime 

to xj and so, by (a), there is some zj such that wjZj = 1 (mod xj). Now let 
z = w\Z\y\ + \v 2 Z 2 y 2 + • • • + w k z k )y. Then z = vvjZjyj = ^/(modjcy). In 
addition, the difference between any two such solutions is divisible by 
each of x \ , . . . , x k and hence by xj . . . x>.] 

3.31 Call a relation R(x 1 , . . . ,x„) arithmetical if it is the interpretation of 
some wf &(xi,. . . ,x„) in the language a of arithmetic with respect to the 
standard model. Show that every recursive relation is arithmetical. [Hint: 
Use Corollary 3.25.] 


3.4 ARTTHMETIZATION. GODEL NUMBERS 

For an arbitrary first-order theory K, we correlate with each symbol u of K 
an odd positive integer called the Godel number of w, in the following 
manner: 
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#(() = 3, #()) = 5, g(, ) = 7, </(->) = 9, = 11,</(V) = 13, 

g(x k ) = 13 + 8* for ft ^1 
g\a k ) = 7 + 8£ for 1 

C/T) = 1 + 8(2"3 fc ) for ft, n ^ 1 
= 3 + 8(2”3*) for ft, n ^ 1 


Clearly, every Godel number of a symbol is an odd positive integer. 
Ivloreover, when divided by 8, g(ii) leaves a remainder of 5 when u is a 
variable, a remainder of 7 when u is an individual constant, a remainder of 1 
when it is a function letter, and a remainder of 3 when u is a predicate letter. 
Thus, different symbols have different Godel numbers. 


Examples 

g(x 2 ) = 29, 0 ( 04 ) = 39, gift) = 97, g(A\) = 147 

Given an expression uqU\ . . . u r , where each itj is a symbol of K, we define 
its Godel number g(uoiti . . . u r ) by the equation 

g(uoUi . . . u r ) = . . .pf llr) 

where pj denotes the jth prime number and we assume that po = 2. For 
example. 


g{A\{x i, x 2 )) = 2 ^) 3 »(O 53 bi) 70 (.)iis(.T 2 ) 13 aO) 

= 2 " 3 3 5 21 7 7 11 29 13 5 

Observe that different expressions have different Godel numbers, by virtue 
of the uniqueness of the factorization of integers into primes. In addition, 
expressions have different Godel numbers from symbols, since the former 
have even Godel numbers and the latter odd Godel numbers. Notice also 
that a single symbol, considered as an expression, has a different Godel 
number from its Godel number as a symbol. For example, the symbol xi has 
Godel number 21 , whereas the expression that consists of only the symbol xi 
has Godel number 2 21 . 

If eo) e \ ) i e r is any finite sequence of expressions of K, we can assign a 
Godel number to this sequence by setting 

g(eo, ei, e,.) = 2 s(eo) 3 s(ei) . . .pf £r) 

Different sequences of expressions have different Godel numbers. Since a 
Godel number of a sequence of expressions is even and the exponent of 2 in 
its prime power factorization is also even, it differs from Godel numbers of 
symbols and expressions. Remember that a proof in K is a certain kind of 
finite sequence of expressions and, therefore, has a Godel number. 

Thus, g is a one-one function from the set of symbols of K, expressions 
of K, and finite sequences of expressions of K, into the set of positive 
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integers. The range of g is not the whole set of positive integers. For ex- 
ample, 10 is not a Godel number. 


Exercises 

3.32 Determine the objects that have the following Godel numbers. 

(a) 1944 (b) 49 (c) 15 (d) 13 824 (e) 2 51 3 n 5 9 

3.33 Show that, if n is odd. An is not a Godel number. 

3.34 Find the Godel numbers of the following expressions. 

(a)/i(oi) (b) ((Vx 3 )Mi(«i, x 3 ))) 

This method of associating numbers with symbols, expressions and se- 
quences of expressions was originally devised by Godel (1931) in order to 
arithmetize metamathematics, ^ that is, to replace assertions about a formal 
system by equivalent number-theoretic statements and then to express these 
statements within the formal system itself. This idea turned out to be the key 
to many significant problems in mathematical logic. 

The assignment of Godel numbers given here is in no way unique. Other 
methods are found in Kleene (1952, chap. X) and in Smullyan (1961, chap. 1, 
§ 6 ). 

DEFINITION 

A theory K is said to have a primitive recursive vocabulary (or a recursive 
vocabulary ) if the following properties are primitive recursive (or recursive); 

i' 

(a) IC(x) : x is the Godel number of an individual constant of K; 

(b) FL(x) : x is the Godel number of a function letter of K; 

(c) PL(x) : x is the Godel number of a predicate letter of K. 


REMARK 

Any theory K that has only a finite number of individual constants, function 
letters, and predicate letters has a primitive recursive vocabulary. For ex- 
ample, if the individual constants of K are aj l , aj 2 , . . . , a Jn , then IC(x) if and 
only if x = 7 + 8y, V x ~ 7 + 8y 2 V . . . V x = 1 + 8y n . In particular, any theory 

tAn arithmetization of a theory K is a one-one function g from the set of 
symbols of K, expressions of K and finite sequences of expressions of K into the set 
of positive integers. The following conditions are to be satisfied by the function g: (1) 
g is effectively computable; (2) there is an effective procedure that determines whether 
any given positive integer m is in the range of g and, if m is in the range of g, the 
procedure finds the object x such that g(x) — m. 
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K in the language ^ a of arithmetic has a primitive recursive vocabulary. So, 
*c has a primitive recursive vocabulary. 
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PROPOSITION 3.26 

get K be a theory with a primitive recursive (or recursive) vocabulary. Then 
l|he following relations and functions (1-16) are primitive recursive (or re- 
sell rsive). In each case, we give first the notation and intuitive definition for 
Ithe relation or function, and then an equivalent formula from which its 
jpfimitive recursiveness (or recursiveness) can be deduced. 

(1) EVbl (x ) : x is the Godel number of an expression consisting of a vari- 
able, (3z) z<¥ (1 Ax = 2 13+Sz ). By Proposition 3.18, this is primitive 
recursive. 

EIC(x) : x is the Godel number of an expression consisting of an in- 
dividual constant, (3y) ><v (IC(y) Ax — 2 y ) (Proposition 3.18). 

EFL(x) : x is the Godel number of an expression consisting of a func- 
tion letter, (3y) j;<x (FL(y) Ax = 2 y ) (Proposition 3.18). 

EPL (x) : x is the Godel number of an expression consisting of a pre- 
dicate letter, (3y) y< v (PL(y) Ax = 2 y ) (Proposition 3.18). 

(2) Arg x (x) = (qt(8, x— 1)) 0 : If x is the Godel number of a function letter 
JJ, then Arg x (x) = n. Arg x (x) is primitive recursive. 

Argp(x) = (qt(8, x-3)) 0 : If x is the Godel number of a predicate letter 
A'j, then Arg P (x) — n. Arg P (x) is primitive recursive. 

(3) Gd(x) : x is the Godel number of an expression of K, EVbl(x)V 
EIC(x) V EFL(x) V EPL(x) V x = 2? Vx = 2 3 4 5 6 Vx=2 7 * Vx = 2 9 V x = 
2 11 V x = 2 13 V(3w) M<v (3u) l)<x (x = u* v A Gd(w) A Gd(r)). Use Corollary 
3.21. Here, * is the juxtaposition function defined in Example 4 on 
page 181. 

(4) MP(x, y, z): The expression with Godel number z is a direct con- 
sequence of the expressions with Godel numbers x and y by modus 
ponens, y = 2 3 * x * 2 11 * z * 2 5 A Gd(x) A Gd(z). 

(5) Gen(x, y): The expression with Godel number y comes from the ex- 
pression with Godel number x by the generalization rule; 

(3r) u<J ,(EVbl(r) A y = 2 3 * 2 3 * 2 13 * r * 2 5 * x * 2 5 A Gd(x)) 

(6) Trm(x) : x is the Godel number of a term of K. This holds when and 

only when either x is the Godel number of an expression consisting of a 

variable or an individual constant, or there is a function letter fg and 

terms t\, . . . , t„ such that x is the Godel number of • - . , t„). The 

latter holds if and only if there is a sequence of n + 1 expressions 



FORMAL NUMBER THEORY 


194 


fk( fk(* i» fe, • • • /* (/i, • ■ • , **-1, • • • , f„-i, t„) 

the last of which, . . . , /„), has Godel number x. This sequence can 
be represented by its Godel number y. Clearly, y < 2 V 3 V . . . rf 
= (2 • 3 • . . . • p«)' v < (/>„!)* < ( a!) a - Note that <?£(y) = n + 1 and also 
that n = Arg T ((x) 0 ), since (x ) 0 is the Godel number of f k . Hence 
Trm(x) is equivalent to the following relation: 

EVbl(x) V EIC(x) V (3^ <(pt , r [x = (v)//(v)-i A 

M(y) — Arg x ((x) 0 ) + 1 A FL(((v) 0 ) 0 ) A ((y) 0 )i = 3 A 

M((y) o) = 2 A ( v “)«<^iKr)- 2 ( 3 l, )i;<r(C) , ) l H-i = &)u * v * 2 7 A Trm(t;))A 

(^WbW)-! = (yh/s (y )-2 * v * 2 5 A Trm(w))] 

Thus, Trm(x) is primitive recursive (or recursive) by Corollary 3.21, 
since the formula above involves Trm(r) for only v < x. In fact, if we 
replace both occurrences of Trm(a) in the formula by (z) v = 0, then the 
new formula defines a primitive recursive (or recursive) relation H(x, z), 
and T rm (x) H(x, ( Cxmi ) ^ (*) ) . Therefore, Corollary 3.21 is applic- 
able. 

(7) Atfml(x) : x is the Godel number of an atomic wf of K. This holds if 
and only if there are terms t \ , . . . , /„ and a predicate letter A n k such that x 
is the Godel number of A k (t\, . . . , /„). The latter holds if and only if 
there is a sequence of n + 1 expressions 

A k ( A k (ti , A k (t \ , ?2i ■ • ■ ? • • • > hi— 1 7 (h , - • • , tn— i , hi) 

the last of which, A" k (t \ , . . . , t n ), has Godel number x. This sequence of 
expressions can be represented by its Godel number y. Clearly, 
y < {p x \) x (as in (6) above) and n — Arg P ((x) 0 ). Thus, Atfml(x) is 
equivalent to the following: 

= (y)™tv)~ 1 A ^ My) = Argp((x) 0 ) + 1 A 
PL(((3 ; )o)o) A ((y)o)i — 3 A A$((y) 0 ) = 2 A 
(y il )u</^ 2 (^ v )v<x((y)u+i = iy)u *v*2 7 A Trm(r)) A 
( 3 0i«x((yW)-i = (y)n w -2 * v * 2 5 A Trm(r))] 

Hence, by Proposition 3.18, Atfml(x) is primitive recursive (or recursive) 

( 8 ) Fml(y): y is the Godel number of a formula of K: 

Atfml(y) V (3z) z<J ,[(Fml(z) Ay = 2 3 * 2 9 *z * 2 5 ) V 

(Fml((z) 0 ) A Fml((z) 1 ) Ay = 2 3 * (z ) 0 * 2 U * (z)j * 2 5 ) V 
(Fml((z) 0 ) A EVbl((z)j) A y — 2 3 * 2 3 * 2 13 * (z) t * 2 5 * (z ) 0 * 2 5 )] 

It is easy to verify that Corollary 3.21 is applicable. 

(9) Subst(x, y, u, v) : x is the Godel number of the result of substituting in 
the expression with Godel number y the term with Godel number u for 
all free occurrences of the variable with Godel number v: 
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Gd(y) A Trm(t/) A EVbl(2 l; ) A [(y = T Ax = u) V 
(3>v) w<J ,(y = 2"’ A y i- 2 V A x = y) V 
(3z) z<; ,(9vv) M , <J ,(Fml(>v) a y = 2 3 * 2 13 *2 V *2 5 *w*z A 
(3a) a<x .(x = 2 3 * 2 13 * 2 V * 2 5 * w * a A Subst(a, z, u, i;))) V 
( (-> ( 3z) z<> , ( 3w) w<y (Fml (w) A y = 2 3 * 2 13 * 2 11 * 2 5 * w * z)) A 

( 3a )«<x( 3 ^)^<x( 3z ) z <y( 1 <z/\y = 2°' ) « * z Ax = ct * ft A 
Subst(a, 2^°, u, i;) A Subst(/?, z, u, t>)))] 

Corollary 3.21 is applicable. The reader should verify that this formula 
actually captures the intuitive content of Subst(x, y, u, v). 

(10) Sub(y, w, v): the Godel number of the result of substituting the term 
with Godel number u for all free occurrences in the expression with 
Godel number y of the variable with Godel number v: 

Sub(v, U, v ) = fJJC x<(pio ,y»’Subst(li, y, U, V ) 

Therefore, Sub is primitive recursive (or recursive) by Proposition 3.18. 
(When the conditions on w, v and y are not met, Sub(y, w, v) is defined, 
but its value is of no interest.) 

(11) Fr(y, v) : y is the Godel number of a wf or term of K that contains free 
occurrences of the variable with Godel number v: 

(Fml(y) VTrm(y)) AEVbl(2 t ) A ->Subst(y, y, 2 13+8l! , v) 

(That is, substitution in the wf or term with Godel number y of a certain 
variable different from the variable with Godel number v for all free 
occurrences of the variable with Godel number v yields a different 
expression.) 

(12) Ff(w, v, \v): u is the Godel number of a term that is free for the vari- 
able with Godel number v in the wf with Godel number w: 

Trmfi/) A EVbl(2 1) ) A Fml(>v) A [Atfml(vv) 

A (3} ; )y<J w = 2 3 * 2 9 * y * 2 5 A Ff(t/, v, y)) 
v = 2 3 *y * 2 U * z * 2 5 

A Ff(i<, v, y) A Ff(w, v, z)) V 
( 3 T)y< w (3z) z<H ,(w = 2 3 * 2 3 * 2 13 * 2 Z * 2 5 *y * 2 5 

A FVbl(2 z ) A (z ^ v => Ff(w, v, y) 

A (Fr(ti, z) => -Fr(y, »))))] 

Use Corollary 3.21 again. 

(13) (a) Axi (a) : x is the Godel number of an instance of axiom schema 

(Al): 

(3 w ).«v(3r) l ,< T (Fml(w) A Fml(i>) 

Ax = 2 3 * u * 2 11 * 2 3 * v * 2 11 * u* 2 5 * 2 5 ) 
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(b) AX 2 (x) : x is the Godel number of an instance of axiom schema 
(A2): 

A Fml(r) A Fml(vv) 

Ax = 2 3 * 2 3 * u * 2 U * 2 3 * v * 2 n * w* 2 5 * 2 s *2 n * 2 3 * 2 3 * u 

* 2 11 * v * 2 5 * 2 11 *2 3 * u * 2 11 * w * 2 5 * 2 5 * 2 5 ) 

(c) AX 3 (x) : x is the Godel number of an instance of axiom schema 
(A3): 

(^^(a^CFmKttjAFml^) 

Ax - 2 3 * 2 3 * 2 3 * 2 9 * v * 2 s * 2 U * 2 3 * 2 9 * u* 2 s * 2 s * 2 U 
* 2? * 2 3 * 2 3 * 2^ * v * 2 5 * 2* * * u * 2~* * 2* * * v * 2 3 % 2 3 ) 

(d) AX 4 (x) : x is the Godel number of an instance of axiom schema 
(A4): 

(3*0 hoc oor ( ^y)y<x (Fml (y ) ATrm(w) A EVbl(2 p ) A Ff( W , «, y) 

A x = 2 3 * 2 3 * 2 3 * 2 13 * 2 V * 2 s * y * 2 11 * Sub(v, u, v ) * 2 s ) 

(e) Axs (x) : x is the Godel number of an instance of axiom schema 
(A5): 

(3 M )«<r( 3w )»<x( 3l, ') w< *(Fml(M) A Fml(w) A EVbl(2 1 ') A -.Fr(«, v) 

Ax = 2 3 * 2? * 2 3 * 2 13 * 2 e * 2 5 * 2 3 * u * 2 11 * >v * 2 5 * 2 s 

* 2 11 * 2 3 * u * 2 11 * 2 3 * 2 3 * 2 13 * 2 V * 2 5 * w * 2 5 * 2 s * 2 5 ) 

(f) LAX(y): y is the Godel number of a logical axiom of K 

Ax! (y) V Ax 2 (y) V Ax 3 (v) V Ax 4 (y ) V Ax 5 (v) 


(14) The following negation function is primitive recursive. Neg(x): the 
Godel number of (_•£$) if x is the Godel number of 

Neg(x) = 2 3 * 2 9 * x * 2 5 

(15) The following conditional function is primitive recursive. Cond(x, y): 
the Godel number of (Pd =>- < d) if x is the Godel number of Pd and y is 
the Godel number of ( £\ 


Cond (x, y) = 2 3 * x * 2 1 1 * y * 2 5 


(16) Clos(w): the Godel number of the closure of Pd if u is the Godel number 
of a wf Pd. First, let V(u) = ^ t ,< H (EVbl(2 1 ’) A Fr(w, i>)). V is primitive 
recursive (or recursive). V (u) is the least Godel number of a free vari- 
able of it (if there are any). Let Sent(w) be Fml(w) A ~ , ( 3l7 )«,<« Fr ( M > «)• 
Sent is primitive recursive (or recursive). Sent(w) holds when and only 
when it is the Godel number of a sentence (i.e., a closed wf). Now let 



2 3 * 2 3 * 2 13 *2 V W *2 5 *t/*2 5 
u 


if Fml(w) A ->Sent(i<) 
otherwise 
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G is primitive recursive (or recursive). If u is the Go del number of a wf 38 
that is not a closed wf, then G(w) is the Godel number of (S/x)38, where x 
is the free variable of 3# that has the least Godel number. Otherwise, 
G(u) = u. Now, let 

H(w, 0) = G(u) 

H(m, y+l) = G(H(a, y)) 


197 


H is primitive recursive (or recursive). Finally, 

Closed = H (u, fiy } ,^„(H(u, y) = H (u, y + l))) 

Thus, Clos is primitive recursive (or recursive). 


PROPOSITION 3.27 

Let K be a theory having a primitive recursive (or recursive) vocabulary and 
whose language contains the individual constant 0 and the function letter // 
of ■ (Thus, all the numerals are terms of K. In particular, K can be .S’ 
itself.) Then the following functions and relation are primitive recursive (or 
recursive) . 

(17) Num(y): the Godel number of the expression y 

Num(0) = 2 15 

Num(y + 1) — 2 49 * 2 3 * Num(y) * 2 5 

Num is primitive recursive by virtue of the recursion rule (V). 

(18) Nu(x) : x is the Godel number of a numeral 

= Num(y)) 

Nu is primitive recursive by Proposition 3.18. 

(19) D(w): the Godel number of if u is the Godel number of a wf 

&{x\)\ 

D(«) = Sub(t/, Num(id, 21) 

Thus, D is primitive recursive (or recursive). D is called the diagonal 
function. 


DEFINITION 

A theory K will be said to have a primitive recursive (or recursive ) axiom set 
if the following property PrAx is primitive recursive (or recursive): 

PrAx(y) : y is the Godel number of a proper axiom of K 

Notice that S has a primitive recursive axiom set. Let a \ , «2» • • • , be the 
Godel numbers of axioms (SI)- (S8). It is easy to see that a number v is the 
Godel number of an instance of axiom schema (A9) if and only if 




FORMAL NUMBER THEORY 


198 

(Bt;) l)<J ,(3w) u><> ,(EVbl(2 1 ') A Fml( w ) 

Ay = 2 3 * Sub(w, 2 15 , v) * 2 U * 2 3 * 2 3 * 2 3 * 2 13 * T * 2 5 

* 2 3 * vv * 2 n * Sub(vv, 2 49 * 2 3 * 2" * 2 5 , v) * 2 5 * 2 5 * 2 11 

* 2 3 * 2 3 * 2 13 * 2 V * 2 5 * >v * 2 5 * 2 5 ) 

Denote the displayed formula by Ag (y) . Then y is the Godel number of a 
proper axiom of S if and only if 

y ~ a\ Vy = a 2 V . . . Vy = a 8 V A 9 (y) 

Thus, PrAx(j’) is primitive recursive for S. 


PROPOSITION 3.28 

Let K be a theory having a primitive recursive (or recursive) vocabulary and 
a primitive recursive (or recursive) axiom set. Then the following three re- 
lations are primitive recursive (or recursive). 

(20) Ax(y): y is the Godel number of an axiom of K: 

LAX(y) V PrAx(y) 

(21) Prf(y): y is the Godel number of a proof in K: 

( 3t 0 lf<> ,(3r) l)<> ,(3z) 2< _ > ,(3w) w<3 ,([v = 2"’ A Ax(vv)] V 

[Prf(w) A Fml((w)J A y = u* 2 V A Gen((t/) tl „ v )] V 

[Prf(u) A Fml((«) z ) A Fml ((i/)J Ay = u*2 v A MP((w) z , («)„„ «)] 

V [Prf(t/) Ay — u * 2 V A Ax(u)] 

Apply Corollary 3.21. 

(22) Pf(y,x): y is the Godel number of a proof in K of the wf with Godel 
number x 

Prf(y) Ax = {y) f/Hy) + l 

The relations and functions of Propositions 3.26 3.28 should have the 
subscript ‘K’ attached to the corresponding signs to indicate the dependence 
on K. If we considered a different theory, then we would obtain different 
relations and functions. 


Exercise 

3.35 

(a) If K is a theory for which the property Fml(y) is primitive recursive (or 
recursive), prove that K has a primitive recursive (or recursive) voca- 
bulary. 
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(b) Let K be a theory for which the property Ax(y) is primitive recursive (or 
recursive). 

(i) Show that K has a primitive recursive (or recursive) vocabulary. 

(ii) Assuming also that no proper axiom of K is a logical axiom, prove 
that K has a primitive recursive (or recursive) axiom set. 
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PROPOSITION 3.29 

Let K be a theory with equality whose language contains the individual 
constant 0 and the function letter // and such that K has a primitive re- 
cursive (or recursive) vocabulary and axiom set. Also assume: 

(*) For any natural numbers r and s, if Ek f = s, then r = s. 

Then any function f{x i, . . . , x „ ) that is representable in K is recursive. 


Proof 

Let 0&(xi % . . . ,x„,x„ + i) be a wf of K that represents /. Let 
P${u \ , . . • , w H , u n +uy) mean that y is the Godel number of a proof in K of the 
wf i, . . . , F/ h ,F/ w+ i). Note that, if P^(«i, • • • , u„, w,,+i,y), then /(«i, . . . , 
u n ) — u n +i- (In fact, let /(iq, . . . , u„) — r. Since & represents / in 
K, \~k ®(u u . . . , w„,r) and Ek (3iy)^?(wi, . . . , u„,y). By hypothesis, P@(u\, 
. . . , M n+ i,p). Hence, Ek ^(F/i, . . . , u„ y u„+\). Since K is a theory with 
equality, it follows that E K r — u n+ 1 . By (*), r — u n+ \.) Now let m be the 
Godel number of &8{x\ , . . . , x„ , x„ + 1 ) . Then P@(u \, . . . , w M , u„ + \,y) is equiva- 
lent to: 

Pf(y, Sub(. . .Sub(Sub(m,Num(tu),21),Num(w2),29) . . .Num(u, I+ i),21 + 8w)) 

So, by Propositions 3.26-3.28, P@(ui , . . . ,« n ,M w+ i,_y) is primitive recursive 
(or recursive). Now consider any natural numbers k\, ... ,k„. Let 
f(kiy . . . ,k n ) = r. Then Ek t%(ki, ... ,k„,f). Let j be the Godel number of a 
proof in K of &(k\, . . . ,&„,r). Then P@(k\, . . . ,k n ,r,j). Thus, for any 
* 1 , . . . , x „ , there is some y such that P@(x \ , . . . ,x„, (y) 0 , (y),). Then, by Ex- 
ercise 3.16(c), iiy(P$(x [,. . . ,x„, (y) 0 , (y)j)) is recursive. But, f{x\ 

= [py{Pa{x i, . . • ,x„, (y) 0 , (j’)i))) 0 and > therefore, / is recursive. 


Exercise 

3.36 Let K be a theory whose language contains the predicate letter = , the 
individual constant 0, and the function letter //. 

(a)If K satisfies hypothesis (*) of Proposition 3.29, prove that K must be 
consistent. 
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(b) If K is inconsistent, prove that every number- theoretic function is re- 
presentable in K. 

(c) If K is consistent and the identity relation x — y is expressible in K, show" 
that K satisfies hypothesis (*) of Proposition 3.29. 


COROLLARY 3.30 

Assume S consistent. Then the class of recursive functions is identical with 
the class of functions representable in S. 


Proof 

We have observed that S has a primitive recursive vocabulary and axiom set. 
By Exercise 3.36(c) and the already noted fact that the identity relation is 
expressible in S, we see that Proposition 3.29 entails that every function 
representable in S is recursive. On the other hand, Proposition 3.24 tells us 
that every recursive function is representable in S. 

In Chapter 5, it will be made plausible that the notion of recursive 
function is a precise mathematical equivalent of the intuitive idea of effec- 
tively computable function. 


COROLLARY 3.31 

A number- theoretic relation R(x \, . . . ,x„) is recursive if and only if it is 
expressible in S. 


Proof 

By definition, R is recursive if and only if Cr is recursive. By Corollary 3.30, 
Cr is recursive if and only if Cr is representable in S. But, by Proposition 
3.13, C R is representable in S if and only if R is expressible in S. 

It will be helpful later to find weaker theories than S for which the 
representable functions are identical with the recursive functions. Analysis 
of the proof of Proposition 3.24 leads us to the following theory. 


Robinson’s System 

Consider the theory in the language SC A with the following finite list of 
proper axioms. 

(l)xi — Xy 

{2)x\ — x 2 => x 2 =xi 


i 
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( 3 ) A'l = X 2 => (x 2 — x 3 A'i = X 3 ) 

(4) X \ - — X2 Xj — Xj 

( 5 ) XI = X 2 =>- (*[ + X3 = X 2 + X 3 A A'3 +A'i = X 3 + A 2 ) 

(6) XI = x 2 => (xi • x 3 = x 2 • x 3 A x 3 • X\ = x 3 • x 2 ) 

11 ) x\ = x ' 2 =>• x\ = x 2 

(8) o ± x\ 

( 9 ) xi 7^ 0 => ( 3 a 2 )(xi = x^) 

(10) xi + 0 — x\ 

(11) Xi +X 2 = (xi + x 2 )' 

(12) xi • 0 = 0 

( 13 ) Xi • x J 2 = (xi - X 2 ) + A'i 

(14) (x 2 = xi • X3 + X4 A a'4 < xi a x 2 = xi • X5 + X& A x^ < xi ) 
x 4 = x 6 (uniqueness of remainder) 
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We shall call this theory RR. Clearly, RR is a subtheory of S, since all the 
axioms of RR are theorems of S. In addition, it follows from Proposition 
2.25 and axioms (l)-(6) that RR is a theory with equality. (The system Q of 
axioms (1) — (13) is due to R.M. Robinson (1950). Axiom (14) has been added 
to make one of the proofs below easier.) Notice that RR has only a finite 
number of proper axioms. 


LEMMA 3.32 

In RR, the following are theorems. 

(a) h+m — n + m for any natural numbers n and m 

(b) n • m — n ■ m for any natural numbers n and m 

(c) h ^ m for any natural numbers such that n ^ m 

(d) n < m for any natural numbers n and m such that n < m 

(e) x / 0 

(f) x^n => x = 0 Vx = I V . . . Vx = n for any natural number n 

(g) x^n Vrr^x for any natural number n 


Proof 

Parts (a)-(c) are proved the same way as Proposition 3.6(a). Parts (d)-(g) are 
left as exercises. 


PROPOSITION 3.33 


All recursive functions are representable in RR. 
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Proof 

The initial functions Z, N, and Uf are representable in RR by the same wfs 
as in Examples 1-3, pagel72. That the substitution rule does not lead out of 
the class of functions representable in RR is proved in the same way as in 
Example 4 on page 172. For the recursion rule, first notice that the proof of 
Lemma 3.22 is a demonstration that Godel’s beta function /?(jci , jc 2 , jc 3 ) i§ 
strongly representable in RR. (Axiom (14) is used for the uniqueness part.) 
Now, a careful examination of the treatment of the recursion rule in the 
proof of Proposition 3.24 reveals that all the required theorems are the- 
orems of RR. The argument given for the restricted /i-operator rule 
also remains valid for RR. 

By Proposition 3.33, all recursive functions are representable in any ex- 
tension of RR. Hence, by Proposition 3.29 and Exercise 3.36(c), in any 
consistent extension of RR in the language £?a that has a recursive axiom 
set, the class of representable functions is the same as the class of recursive 
functions. Moreover, by Proposition 3.13, the relations expressible in such a 
theory are the recursive relations. 

£xercises 

3.37 d Show that RR is a proper subtheory of S. [Hint: Find a model for RR 
that is not a model for S.] ( Remark : Not only is S different from RR, but it is 
not finitely axiomatizable at all, that is, there is no theory K having only a 
finite number of proper axioms, whose theorems are the same as those of S. 
This was proved by Ryll-Nardzewski (1953).) 

3.38 Show that axiom (14) of RR is not provable from axioms (1)— (13) 
and, therefore, that Q is a proper subtheory of RR. [Hint: Find a model of 
( 1 )— ( 1 3) for which (14) is not true.] 

3.39 Let K be a theory in the language ££ & with just one proper axiom; 
(Vxi)(Vx 2 )xi = X2. 

(a) Show that K is a consistent theory with equality. 

(b) Prove that all number-theoretic functions are representable in K. 

(c) Which number-theoretic relations are expressible in K? [Hint: Use 
elimination of quantifiers.] 

(d) Show that the hypothesis 0 ^ 1 cannot be eliminated from Propo- 
sition 3.13. 

(e) Show that, in Proposition 3.29, the hypothesis (*) cannot be replaced by 
the assumption that K is consistent. 

3.40 Let R be the theory in the language <£ a having as proper axioms the 
equality axioms (l)-(6) of RR as well as the following five axiom schemas, in 
which n and m are arbitrary natural numbers: 

(Rl) n + m = n T tn 
(R2) n ■ m = n • m 
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* (R3) n / m if n / m 
(R4) x^n x = 0 V . . . V x = n 
|(R5) x^n V n^x 

Prove the following. 

(a) R is not finitely axiomatizable. [Hint: Show that every finite subset of 
the axioms of R has a model that is not a model of R.] 

(b) R is a proper subtheory of Q. 

(c) D Every recursive function is representable in R. (See Monk, 1976, p. 248.) 

(d) The functions representable in R are the recursive functions. 

(e) The relations expressible in R are the recursive relations. 

3.5 THE FIXED-POINT THEOREM. GODEL’S INCOMPLETENESS 
THEOREM 

If K is a theory in the language J£a, recall that the diagonal function D has 
the property that, if u is the Godel number of a wf 5${x \ ), then D(w) is the 
Godel number of the wf &{u). 

NOTATION 

When £ is an expression of a theory and the Godel number of £ is q, then 
we shall denote the numeral q by We can think of r< ^ n as being a 

‘name’ for £ within the language ££ a- 

PROPOSITION 3.34 (DIAGONALIZATION LEMMA) 

Assume that the diagonal function D is representable in a theory with 
equality K in the language J£ A . Then, for any wf $(x\) in which X\ is the 
only free variable, there exists a closed wf £ such that 

Proof 

Let @(x\,X 2 ) be a wf representing D in K. Construct the wf 

(V) (Vx 2 )(^(x 1 ,x 2 ) => S{x 2 )) 

Let m be the Godel number of (V). Now substitute m for xi in (V): 

(V) (V* 2 )(S>(/7i,x 2 ) => S{x 2 )) 
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Let q be the Godel number of this wf ( S. So, qis r ^ 7 _i . Clearly, D(m) — q. (I n 
fact, m is the Godel number of a wf £%(x \ ) , namely, (V), and q is the Godel 
number of $(m).) Since <2) represents D in K, 

(d) b K q) 

(a) Let us show Fk # => &(q)- 


i. <e 

Hyp 

2. (Vx 2 )(®(w,x 2 ) =» ^(* 2 )) 

Same as 1 

3. <2>(m,q) => ^(q)) 

2, rule A 4 

4. &)(m,q) 

(d) 

5. £{q) 

3, 4, MP 

6. ^ Fk $(q) 

1-5 

7. h K & => £{q) 

1-6, Corollary 2.6 

(b) Let us prove Fk ${q) => 

1. S(q) 

Hyp 

2. &>(m,x 2 ) 

Hyp 

3. (3ix 2 )^(m,x 2 ) 

<£> represents D 

4. Q) («? , q) 

(d) 

II 

s 

2-4, properties of = 

6. S(x 2 ) 

1, 5, substitutivity of 

7. &'(q),@(m,x 2 ) F k <?(x 2 ) 

1-6 

8. ${q) Fk f^(w,x 2 ) =>- <^(x 2 ) 

1—7, Corollary 2.6 

9. £{q) F k (Vx 2 )(^(m,x 2 ) S‘{x 2 )) 

8, Gen 

10. Fk <%(q) => (Vx 2 )(^(m,x 2 ) => <^(x 2 )) 

1-9, Corollary 2.6 

11. F k ${q) 

Same as 10 


From parts (a) and (b), by biconditional introduction, Fk ^ ^ <%(q)- 


PROPOSITION 3.35 (FIXED-POINT THEOREM)! 

Assume that all recursive functions are representable in a theory with 
equality IC in the language S£ a- Then, for any wf <f(xi) in which xj is the 
only free variable, there is a closed wf ^ such that 

\- K <g<=>£( r <g n ) 


tThe terms ‘fixed-point theorem’ and ‘diagonalization lemma’ are often used 
interchangeably, but I have adopted the present terminology for convenience of 
reference. The central idea seems to have first received explicit mention by Carnap 
(1934), who pointed out that the result was implicit in the work of Godel (1931). The 
use of indirect self-reference was the key idea in the explosion of progress in math- 
ematical logic that began in the 1 930s 
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Proof 


By Proposition 3.27, D is recursive.! Hence, D is representable in K and 
proposition 3.34 is applicable. 


By Proposition 3.33, the fixed-point theorem holds when K is RR or any 
extension of RR. In particular, it holds for S. 


definitions 

Let K be any theory whose language contains the individual constant 0 and 
the function letter //. Then K is said to be co-consistent if, for every wf $ (x) 
of K containing x as its only free variable, if h K ~d8(n) for every natural 
number n, then it is not the case that h K (3x)^?(x). 

Let K be any theory in the language ££ A . K is said to be a true theory if all 
proper axioms of K are true in the standard model. (Since all logical axioms 
are true in all models and MP and Gen lead from wfs true in a model to wfs 
true in that model, all theorems of a true theory will be true in the standard 
model.) 

Any true theory K must be co-consistent. (In fact, if h K ~>$(h) for all 
natural numbers n, then ,#(x) is false for every natural number and, 
therefore, (zbc)^(x) cannot be true for the standard model. Hence, 
(Ebc)^(x) cannot be a theorem of K.) In particular, RR and S are co- 
consistent. 


PROPOSITION 3.36 

If K is co-consistent, then K is consistent. 


Proof 

Let <o (x) be any wf containing x as its only free variable. Let &(x) be 
£‘{x) A -*${x). Then ->£S(h) is an instance of a tautology. Hence, “>$?(«) 
for every natural number n. By co-consistency, not-F^ (zbt)^(x). Therefore, 
K is consistent. (Remember that every wf is provable in an inconsistent 


fin fact, D is primitive recursive, since K, being a theory in has a primitive 
recursive vocabulary. 
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theory, by virtue of the tautology -*A =>■ (A => B). Hence, if at least one wf is 
not provable, the theory must be consistent.) 


It will turn out later that the converse of Proposition 3.36 does not hold. 


DEFINITION 

An undecidable sentence of a theory K is a closed wf fA of K such that neither 
$ nor is a theorem of K, that is, such that not-bK and not-b K 


Godel’s incompleteness theorem 

Let K be a theory with equality in the language S3 a satisfying the following 
three conditions: 

1. K has a recursive axiom set (that is, PrAx(j-) is recursive). 

2. Hk 0^1. 

3. Every recursive function is representable in K. 

By assumption 1, Propositions 3.26-3.28 are applicable. By assumptions 2 
and 3 and Proposition 3.13, every recursive relation is expressible in K. By 
assumption 3, the fixed-point theorem is applicable. Note that K can be 
taken to be RR, S, or, more generally, any extension of RR having a re- 
cursive axiom set. Recall that Pf(.y,x) means that y is the Godel number of a 
proof in K of a wf with Godel number x. By Proposition 3.28, Pf is re- 
cursive. Hence, Pf is expressible in K by a wf (x 2 , xi ) . Let <S(x\ ) be the wf 
(Vx 2 )-«^/(x2,xi). % the fixed-point theorem, there must be a closed wf 3 
such that 

s' 

($) \-K& ** (\/X 2 )^/(x 2 , r $ n ) 

Observe that, in terms of the standard interpretation, (Vx' 2 ) ->^/(x 2 , r 3 n ) 
says that there is no natural number that is the Godel number of a proof in 
K of the wf 3, which is equivalent to asserting that there is no proof in K of 
3. Hence, 3 is equivalent in K to an assertion that 3 is unprovable in K. In 
other words, 3 says ‘I am not provable in K\ This is an analogue of the liar 
paradox: T am lying’ (that is, T am not true’). However, although the liar 
paradox leads to a contradiction, Godel (1931) showed that 3 is an un- 
decidable sentence of K. We shall refer to 3 as a Godel sentence for K. 


PROPOSITION 3.37 (GODEL’S INCOMPLETENESS THEOREM) 


Let K satisfy conditions 1—3. Then: 
(a) If K is consistent, not-b^ 3. 
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(b) If K is o>-consistent, not-h^ -3§. 

Hence, if K is co-consistent, 3 is an undecidable sentence of K. 


Proof 

Let q be the Godel number of 3. 

(a) Assume E^ <3. Let r be the Godel number of a proof in K of 3. 

Then Pf(r, q). Hence, h t that is \- K S/f(r, r 3~ i ). But, from ($) 

above by biconditional elimination, Ek (Va' 2 ) -&f (x 2 f 3 1 ) . By rule A4, 
Ek (r/ ^ n )• Therefore, K is inconsistent. 

(b) Assume K is co-consistent and E K -33. From ($) by biconditional 
elimination, Ek ~^{fxf)^S/ > f[x 1 f3 '), which abbreviates to 

(*) E K (3x 2 )^/(x 2 , r ^) 

On the other hand, since K is co-consistent, Proposition 3.36 implies that K 
is consistent. But, Ek ~>3. Hence, not-EK 3, that is, there is no proof in K of 
3. So, Pf(/?, q) is false for every natural number n and, therefore, 
Ek ~'S3f(h, r 3 n ) for every natural number n. (Remember that r 3 n is q.) 
By co-consistency, not-EK (3x2)^f(x2, r 3 n ), contradicting (*). 


REMARKS 

Godel’s incompleteness theorem has been established for any theory with 
equality K in the language S3 a that satisfies conditions 1-3 above. Assume 
that K also satisfies the following condition: 

(+) K is a true theory. 

(In particular, K can be S or any subtheory of S.) Proposition 3.37(a) shows 
that, if K is consistent, 3 is not provable in IC. But, under the standard 
interpretation, 3 asserts its own unprovability in K. Therefore, 3 is true for 
the standard interpretation. 

Moreover, when K is a true theory, the following simple intuitive argu- 
ment can be given for the undecidability of 3 in K. 

(i) Assume Ek 3. Since Ek 3 (Vx 2 ) _, ^/(x 2 , r '^’ n ), it follows that 

E k (Vx 2 ) (x2-, r 3 i ). Since K is a true theory, ( Vjc 2 ) ->0*f {xj f 3^) is true 
for the standard interpretation. But this wf says that 3 is not provable in K, 
contradicting our original assumption. Hence, not-EK 3. 

(ii) Assume Ek -*3. Since E k3 3$ 3~*), 

Er -«(Vjc 2 ) ^/(* 2 , r ^ n ). So, Ek ( 3x2)^f(x2, r 3 n ). Since K is a tiue 
theory, this wf is true for the standard interpretation, that is, 3 is provable 
in K. This contradicts the result of (i). Hence, not-EK ~S3. 




FORMAL NUMBER THEORY 


208 


Exercises 

3.41 Let ^ be a Godel sentence for S. Let S g be the extension of S obtained 
by adding as a new axiom. Prove that, if S is consistent, then S g is 
consistent, but not oj-consistent. 

3.42 A theory K whose language has the individual constant 0 and func- 
tion letter f\ is said to be to- incomplete if there is a wf &{x) with one free 
variable x such that h K £'(n) for every natural number n, but it is not the 
case that Hk (Vx)^(x). If K is a consistent theory with equality in the lan- 
guage <g A and satisfies conditions 1-3 on page 206, show that K is co- 
incomplete. (In particular, RR and S are co-incomplete.) 

3.43 Let K be a theory whose language contains the individual constant 0 
and function letter f\ . Show that, if K is a consistent and co-inconsistent, 
then K is co-incomplete. 

3.44 Prove that S, as well as any consistent extension of S having a re- 
cursive axiom set, is not a scapegoat theory. (See page 87.) 

3.45 Show that there is an oj-consistent extension K of S such that K is not 
a true theory. [Hint: Use the fixed point theorem.] 

The Godel-Rosser incompleteness theorem 

The proof of undecidability of a Godel sentence r S required the assumption 
of co-consistency. We will now prove a result of Rosser (1936) showing that, 
at the cost of a slight increase in the complexity of the undecidable sentence, 
the assumption of co-consistency can be replaced by consistency. 

As before, let K be a theory with equality in the language ££ a satisfying 
conditions 1—3 on page 206. In addition, assume: 

4. Fk * = 0 V x = T V . . . V x — ft for every natural number n. 

5. h k x^ 7 i V n for every natural number n. 

Thus, K can be any extension of RR with a recursive axiom set. In parti- 
cular, K can be RR or S. 

Recall that, by Proposition 3.26 (14), Neg is a primitive recursive func- 
tion such that, if x is the Godel number of a wf 0, then Neg(x) is the Godel 
number of (->&). Since all recursive functions are representable in K, let 
,r^(x u x 2 ) be a wf that represents Neg in K. Now construct the following 
wf S{x\)\ 

(Vx 2 )(^/(x 2 ,xi) => (Vx 3 )(./%(x 1 ,X 3 ) =4* (3x 4 )(x 4 ^x2 A^Y(x 4 ,x 3 )))) 

By the fixed-point theorem, there is a closed wf 01 such that 

(*) l-K &<*&{ r 0T) 

01 is called a Rosser sentence for K. Notice what the intuitive meaning of 01 
is under the standard interpretation. 01 asserts that, if 0t has a proof in K, 
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say with Gcdel number x 2 , then has a proof in K with Godel number 
smaller than x 2 . This is a roundabout way for 01 to claim its own un- 
provability under the assumption of the consistency of K. 


PROPOSITION 3.38 (GODEL-ROS8ER THEOREM) 

Let K satisfy conditions 1-5. If K is consistent, then PA is an undeddable 
sentence of K. 


Proof 

Let p be the Godel number of 0t. Thus, r fA~ 1 is p. Let j be the Godel number 
of ~i0l. 

(a) Assume Fr 0t- Since Fr ^ S'( r M 1 ), biconditional elimination 
yields F K £( r P? 1 ), that is: 

Fk (Vx 2 )(^/(x 2 ,p) => (Vx 3 ){J r e^{p,Xi) => (3x 4 )(x 4 <x 2 A 0 /(x 4 , x 3 ) ) ) ) 

Let k be the Godel number of a proof in K. of PA. Then Pf (k,p) and, 
therefore, Fr PP/(k,p). Applying rule A4 to <P‘( r 0 ?~ l ), we obtain 

Fr PA/(k, p) => (Vx 3 )(./fA^(p, x 3 ) =» (3x 4 )(x 4 <& A ^/(x 4 , x 3 ))) 

So, by MP, 

(%) Fr (Vx 3 )(./!%( a x 3 ) => (3x 4 )(x 4 spk A ^/(x 4 , x 3 ))) 

Since j is the Godel number of -^0t, we have Neg(/?, j), and, therefore, 
Fr Megip, j). Applying rule A4 to (%), we obtain Fr j) =>■ (3x 4 ) 

(x 4 ^k A PAf (x 4 , j)). Hence, by MP, Fr (=bc 4 )(x 4 ^k A. ^/(x 4 , j)), which is an 
abbreviation for 

(#) Fr ->(Vx 4 )-'(x 4 A ^/(X 4 , j)) 

Since Fr Pft, the consistency of K implies not- Fr -*0t. Hence, Pf(«, j) is false 
for all natural numbers n. Therefore, Fr j) for all natural numbers 

n. Since K is a theory with equality, Fr x 4 = n =>- -<^y(x 4 , j) for all natural 
numbers n . By condition 4, 

Fr x 4 =4* x 4 = 0 Vx 4 = 1 V . . . V x 4 = k 


Fr x 4 = n => -&/(x 4 , j) for n = 0, 1 , . . . , k 

So, by a suitable tautology, ( <f) and ( § f) yield Fr x 4 ^k => ->0 > /(x4 i J) 
and then, by another tautology, F K -i(x 4 ^k A ^/(x 4 , 7)). By Gen, 
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E K ( Vx 4 ) -> (x 4 A S 1 / (x 4 ,j)). This, together with (#), contradicts the - 

consistency of K. 

(b) Assume Ek -'St. Let m be the Godel number of a proof of -iSt in K 
So, Pf(m, j) is true and, therefore, E K S > /(in, /). Hence, by an application of 
rule E4 and the deduction theorem, E K m^x 2 =>■ ( 3 x 4 ) (x 4 5 ^X 2 A#/(x 4) ]))_ 
By consistency of K, not-b-K St and, therefore, Pf(w, p) is false for all 
natural numbers n. Hence, E K -»S > /(n i p) for all natural numbers 
n. By condition 4, Hr X 2 ^m =*► x 2 = 0 V X 2 = I V . . . V x 2 = in. Hence, 
E K X 2 ^m => —>S ) / (x' 2 , j?) - Consider the following derivation. 


L S > /(x 2 ,p) 

Hyp 

2 . ~Yey(p, x 3 ) 

Hyp 

3. x 2 ^th V m^x 2 

Condition 5 

4. in ^x 2 =b ( 3 .X 4 ) (*4 A S’/ (x 4 , /)) 

Proved above 

5. x 2 ^m^ ->S > /(x 2 , p) 

Proved above 

6 . ->S > /(x 2 , p) V (3x 4 )(x4 ^x 2 A S > /(x4, j)) 3-5, tautology 

7. (3x 4 )(x 4 ^X2 A S/(X 4 , j )) 

1 , 6 , disjunction rule 

8 . j\ r eg{p, j) 

Proved in part (a) 

9. ( 3 ix 3 pV^(p t X3) 

Jfeg. represents Neg 

10. X3 = j 

2, 8 , 9, properties of = 

1 1 . ( 3 x 4 ) (jX4 ^X 2 A Sf (X4 5X3)) 

7, 10, substitutivity of = 

12. S*/(x 2, P), AVe#(p, x 3 ) Ek ( 3 x 4 ) 

1-11 

(X4 ^X 2 A S > /(x 4, X 3 )) 


13. S>/(x 2 , p) Ek * 3 ) 

1-12, Corollary 2.6 

=> ( 3 x 4 )(x 4 ^x 2 a S > /(X4, x 3 )) 


14. S > /(x 2 ,p) Ek (Vx 3 )(^l %(p, x 3 ) 

13, Gen 

=> ( 3 x 4 )(x 4 ^X 2 A ^/(X4, X 3 ))) 


15. E k SPf{x 2 , p) =>- (Vx 3 )(./%(p, x 3 ) 

1-14, Corollary 2.6 

( 3 x 4 ) (X4 ^X 2 A S ) /(x4, x 3 ))) 


16. Ek (Vx 2 )(^/(x 2 , p) ^(Vx 3 ) 

1 5, Gen 

{Mp{p, x 3 ) =b ( 3 x 4 )(x 4 ^a- 2 A 


S>/(X 4 , x 3 )))) 


17. E k St 

(*1, biconditional elimination) 


Thus, Ek St and Ek -'St, contradicting the consistency of K. 

The Godel and Rosser sentences for the theory S are undecidable sen- 
tences of S. They have a certain intuitive metamathematical meaning; for 
example, a Godel sentence 2? asserts that 2? is unprovable in S. Until re- 
cently, no undecidable sentences of S were known that had intrinsic math- 
ematical interest. However, in 1977, a mathematically significant sentence of 
combinatorics, related to the so-called finite Ramsey theorem, was shown to 
be undecidable in S (see Kirby and Paris, 1977; Paris and Harrington, 1977; 
and Paris, 1978). 
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PEFINITION 

A theory K is said to be recursively axiomatizable if there is a theory K* 
having the same theorems as K such that K* has a recursive axiom set. 

COROLLARY 3.39 

Let K be a theory in the language ££ a- If K is a consistent, recursively 
axiomatizable extension of RR, then K has an undecidable sentence. 

Proof 

Let K* be a theory having the same theorems as K and such that K* has a 
recursive axiom set. Conditions 1-5 of Proposition 3.38 hold for K*. Hence, 
a Rosser sentence for K* is undecidable in K* and, therefore, also un- 
decidable in K. 

An effectively decidable set of objects is a set for which there is a me- 
chanical procedure that determines, for any given object, whether or not 
that object belongs to the set. By a mechanical procedure we mean a pro- 
cedure that is carried out automatically without any need for originality or 
ingenuity in its application. On the other hand, a set A of natural numbers is 
said to be recursive if the property x G A is recursive.* The reader should be 
convinced after Chapter 5 that the precise notion of recursive set corresponds 
to the intuitive idea of an effectively decidable set of natural numbers. This 
hypothesis is known as Church’s thesis. 

Remember that a theory is said to be axiomatic if the set of its axioms is 
effectively decidable. Clearly, the set of axioms is effectively decidable if and 
only if the set of Godel numbers of axioms is effectively decidable (since we 
can pass effectively from a wf to its Godel number and, conversely, from the 
Godel number to the wf). Hence, if we accept Church’s thesis, to say that K 
has a recursive axiom set is equivalent to saying that K is an axiomatic 
theory, and, therefore. Corollary 3.39 shows RR is essentially incomplete , 
that is, that every consistent axiomatic extension of RR has an undecidable 
sentence. This result is very disturbing; it tells us that there is no complete 
axiomatization of arithmetic, that is, there is no way to set up an axiom 
system on the basis of which we can decide all problems of number theory. 

Exercises 

3.46 Church’s thesis is usually taken in the form that a number-theoretic 
function is effectively computable if and only if it is recursive. Prove that this is 
equivalent to the form of Church’s thesis given above. 

*To say that x 6 A is recursive means that the characteristic function Ca is a 
recursive function, where Q(x) = 0 if x 6 A and Ca{x) — 1 if x f A (see page 180). 
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3.47 Let IC be a true theory that satisfies the hypotheses of the GodeL 
Rosser theorem. Determine whether a Rosser sentence for K is true for 
the standard interpretation. 

3.48 (Church, 1936b) Let Tr be the set of Godel numbers of all wfs in the 
language T£ A that are true for the standard interpretation. Prove that Tr is 
not recursive. (Hence, under the assumption of Church’s thesis, there is no 
effective procedure for determining the truth or falsity of arbitrary sentences 
of arithmetic.) 

3.49 Prove that there is no recursively axiomatizable theory that has Tr as 
the set of Godel numbers of its theorems. 

3.50 Let K be a theory with equality in the language A that satisfies 
conditions 4 and 5 on page 208. If every recursive relation is expressible in 
K, prove that every recursive function is representable in K. 


GodeFs Second Theorem 

Let K be an extension of S in the language T£ A such that K has a recursive 
axiom set. Let be the following closed wf of K: 

(Vxi ) ( V*2 ) ( V*3 ) ( Vx 4 ) -I {2Pf (*1 , x 3 ) A S?/(x 2 , x 4 ) A J r e?(x 3, x 4 )) 

For the standard interpretation, asserts that there are no proofs in K 

of a wf and its negation, that is, that K is consistent. 

Consider the following sentence: 

( 6 ) ty>09iYL =7* 

where ^ is a Godel sentence for K. Remember that 3 asserts that 3 is 
improvable in K. Hence, (G) states that, if IC is consistent, then 3 is not 
provable in IC. But that is just the first half of Godel’s incompleteness 
theorem. The metamathematical reasoning used in the proof of that theo- 
rem can be expressed and carried through within IC itself, so that one ob- 
tains a proof in IC of (G) (see Hilbert & Bernays, 1939, pp. 285-328; 
Feferman, 1960). Thus, \~k => 3. But, by Godel’s incompleteness 

theorem, if K is consistent, 3 is not provable in K. Hence, if K is consistent, 
is not provable in K. 

This is Godel 's second theorem (1931). One can paraphrase it by stating 
that, if K is consistent, then the consistency of K cannot be proved within IC, 
or, equivalently, a consistency proof of K must use ideas and methods that 
go beyond those available in K. Consistency proofs for S have been given by 
Gentzen (1936; 1938) and Schutte (1951), and these proofs do, in fact, 
employ notions and methods (for example, a portion of the theory of de- 
numerable ordinal numbers) that apparently are not formalizable in S. 

Godel’s second theorem is sometimes stated in the form that, if a ‘suffi- 
ciently strong’ theory K is consistent, then the consistency of IC cannot be 




THE FIXED-POINT THEOREM. GODEL’S incompleteness theorem 

proved within K. Aside from the vagueness of the ‘sufficiently strong’ (which 
tan be made precise without much difficulty), the way in which the con- 
sistency of K is formulated is crucial. Feferman (1960, Cor. 5.10) has shown 
*tfiat there is a way of formalizing the consistency of S - say, ^Wg - such 
-that bs ^Wg. A precise formulation of Godel second theorem may be found 
In Feferman (1960). (See Jeroslow (1971; 1972; 1973) for further clarification 
and development.) 

In their proof of Godel’s second theorem, Hilbert and Bernays (1939) 
'based their work on three so-called derivability conditions. For the sake of 
definiteness, we shall limit ourselves to the theory S, although everything we 
say also holds for recursively axiomatizable extensions of S. To formulate 
the Hilbert-Bernays results, let 38 eev(x \ ) stand for (3x2)3?/ (xz, xi). Thus, 
under the standard interpretation, 3ftea>(x \ ) means that there is a proof in S 
of the wf with Godel number xi ; that is, the wf with Godel number xi is 
provable in S.* Notice that a Godel sentence ^ for S satisfies the fixed-point 
condition: bs o ->3$ ee#( r ^ ) . 
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THE HILBERT-BERNAYS DERIVABILITY CONDITIONS* 

(HB1) If b s then b s 38 e^^). 

(HB2) b S 3Sea>( r V => =b (38eu>( r ^) =b 
(HB3) bs 38e^(/ c 3~^) =b 38 eu>^ 38 ) 

Here, and 38 are. arbitrary closed wfs of S. (HB1) is straightforward and 
(HB2) is an easy consequence of properties of 38/. However, (HB3) requires 
a careful and difficult proof. (A clear treatment may also be found in Boolos 
(1993, chap. 2), and in Shoenfield (1967, pp. 211—213).) 

A Godel sentence ^ for S asserts its own unprovability in S: 
bs ^ bb ->3$£^{ r y 1 ). We also can apply the fixed-point theorem to obtain 
a sentence Jf such that bs #8 <*b 38eev(/3tf > ~ [ ). M is called a Henkin sentence 
for S. -38 asserts its own provability in S. On intuitive grounds, it is not clear 
whether 38 is true for the standard interpretation, nor is it easy to determine 
whether 38 is provable, disprovable or undecidable in S. The problem was 
solved by Lob (1955) on the basis of Proposition 3.40 below. First, however, 
let us introduce the following convenient abbreviation. 


+ ‘Bew’ consists of the first three letters of the German word beweisbar, which 
means ‘provable’. 

*These three conditions are simplifications by Lob (1955) of the original Hil- 
bert— Bernays conditions. 
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NOTATION 

Let □*? stand for where c € is any wf. Then the Hilbert-Bernays® 

derivability conditions become: 

(HB1) If h s % then h s d^- 

(HB 2 ) p s n(& =4 =4 (n% =4 n@) 

(HB3) h s JJ& =4 □ d^ 

The Godel sentence and the Henkin sentence satisfy the equivalences 

hs *3 44 and hs 44 d*^* 


PROPOSITION 3.40 (LOB’S THEOREM) 

Let ^ be a sentence of S. If hs d^ =4 then hs c €. 
Proof 


Apply the fixed-point theorem to the wf 38.eu>{x \ ) =4> W to obtain a sentence 
such that hs ££ 44 (^’^( r J2 ?n ) =4 ) . Thus, hs 44 (dJSf =4 #). Then 

we have the following derivation of ( €. 


1. h S ^ 44 (d^ =4 

Obtained above 

2. hs Jzf =4 {U& =4 <$) 

1, biconditional elimination 

3. hs U{& =4 (d^ -4 Sf)) 

2, (HB1) 

4. hs d^ =4 d(d^ =4 <#) 

3, (HB2), MP 

5. hs d(d^ =4 <#) =4 (d d^f =4 d^) 

(HB2) 

6. h s d^4(dD^4 d^) 

4, 5 tautology 

7. hs d^ =4 d d^ 

(HB3) 

8. hs d^ =4 d^ 

6, 7, tautology 

9. h s d^=4 ^ 

Hypothesis of the theorem 

10. hs d^ =4 ^ 

8, 9, tautology 

11. hs ^ 

1,10, biconditional elimination 

12. hs d^ 

11, (HB1) 

13. h s 

10, 12, MP 


COROLLARY 3.41 

Let be a Henkin sentence for S. Then hs and 2tf' is true for the 
standard interpretation. 

Proof 

h s JC 44 dJC. By biconditional elimination, hs =4 . So, by Lob’s 

theorem, hs Since M asserts that is provable in S, JC is true 
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Lob’s theorem also enables us to give a proof of Godel’s second theorem 
for S. 
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PROPOSITION 3.42 (GODEL’S SECOND THEOREM) 
If S is consistent, then not-h s <^Ws. 


froof 

Assume S consistent. Since hs 0 ^ T, the consistency of S implies not- 
ing o = 1. By Lob’s theorem, not-hs D(0 = 1) => 0 =1. Hence, by the 
tautology ~>A (A => fi), we have: 

(*) not- h s --□(0 = 1) 

But, since hs 0 ^ I, (HB1) yields hs 0(0 ^ T). Then it is easy to show that 
L s ^V^s -'□(0 = T). So, by (*), not-hs ‘^W-s* 

Boolos (1993) gives an elegant and extensive study of the fixed-point 
theorem and Lob’s theorem in the context of an axiomatic treatment of 
provability predicates. Such an axiomatic approach was first proposed and 
developed by Magari (1975). 


Exercises 

3.51 Prove (HB1) and (HB2). 

.3.52 Give the details of the proof of hs ^Ws =>• ->&eev( r 0 = l n ), which 
was used in the proof of Proposition 3.42. 

3.53 If ^ is a Godel sentence of S, prove hs ^ -^W>( r 0 = T n ). 

(Hence, any two Godel sentences for S are provably equivalent. This is an 
instance of a more general phenomenon of equivalence of fixed-point sen- 
tences, first noticed and verified independently by Bernardi (1975; 1976), De 
Jongh, and Sambin (1976). See Smorynski (1979; 1982).) 

3.54 In each of the following cases, apply the fixed-point theorem for S to 
obtain a sentence of the indicated kind; determine whether that sentence is 
provable in S, disprovable in S, or undecidable in S; and determine the truth 
or falsity of the sentence for the standard interpretation. 

(a) A sentence ^ that asserts its own decidability in S (that is, that hs or 
hs -i#). 

(b) A sentence that asserts its own undecidability in S. 

(c) A sentence asserting that not-hs 

(d) A sentence asserting that hs 
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3.6 RECURSIVE UNDECIDABILITY. CHURCH’S THEOREM 
If K is a theory, let 7k be the set of Godel numbers of theorems of K. 


DEFINITIONS 

K is said to be recursively decidable if 7 k is a recursive set (that is, the 
property x E 7k is recursive). K is said to be recursively imdecidable if 7L j s 
not recursive. K is said to be essentially recursively imdecidable if K and all 
consistent extensions of K are recursively undecidable. 

If we accept Church’s thesis, then recursive undecidability is equivalent to 
effective undecidability, that is, non-existence of a mechanical decision 
procedure for theoremhood. The non-existence of such a mechanical pro- 
cedure means that ingenuity is required for determining whether arbitrary 
wfs are theorems. 


Exercise 

3.55 Prove that an inconsistent theory having a recursive vocabulary is 
recursively decidable. 


PROPOSITION 3.43 

Let K be a consistent theory with equality in the language *£ A in which the 
diagonal function D is representable. Then the property x 6 7k is not ex- 
pressible in K. 


Proof 

Assume x E 7k is expressible in IC by a wf .T{x\). Thus: 

(a) If n 6 7k, F K f{n). 

(b) If n £ 7k, hc-^(n). 

By the diagonalization lemma applied to -i5 r (x 1 ), there is a sentence r d such 
that Hk ^ ^ Let q be the Godel number of ( S. So: 

(c) I-k ^ 

Case 1: Fk Then q 6 ?k. By (a), F K But, from Fk ^ and (c), by 

biconditional elimination, Fk _, ^"(<7)- Hence K is inconsistent, contradicting 
our hypothesis. 
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Case 2: not-l-K So, q 7k. By (b), h K ~'3T(q). Hence, by (c) and 
biconditional elimination, \~k < €. 

Thus, in either case a contradiction is reached. 
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DEFINITION 

A set B of natural numbers is said to be arithmetical if there is a wf 3ft {x) in 
the language !£ a, with one free variable x, such that, for every natural 
number «, n e B if and only if 38{h) is true for the standard interpretation. 


COROLLARY 3.44 [TARSKI’S THEOREM (1936)] 

Let Tr be the set of Godel numbers of wfs of S that are true for the standard 
interpretation. Then Tr is not arithmetical. 


Proof 

Let Jf be the extension of S that has as proper axioms all those wfs that are 
true for the standard interpretation. Since every theorem of J r must be true 
for the standard interpretation, the theorems of Jf are identical with the 
axioms of Jf. Hence, Tj- — Tr. Thus, for any closed wf 3S, 3ft holds for the 
standard interpretation if and only if h j- 3ft. It follows that a set B is ar- 
ithmetical if and only if the property x E B is expressible in J r . We may 
assume that Jf is consistent because it has the standard interpretation as a 
model. Since every recursive function is representable in S, every recursive 
function is representable in Jf and, therefore, D is representable in J r . By 
Proposition 3.43, x E Tr is not expressible in J r . Hence, Tr is not ar- 
ithmetical. (This result can be roughly paraphrased by saying that the no- 
tion of arithmetical truth is not arithmetically definable.) 


PROPOSITION 3.45 

Let K be a consistent theory with equality in the language Sf a in which all 
recursive functions are representable. Assume also that Hk 0 ^ T. Then K is 
recursively undecidable. 


Proof 

D is primitive recursive and, therefore, representable in K. By Proposition 
3.43, the property x e T K is not expressible in K. By Proposition 3.13, the 
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characteristic function Ct k is not representable in K. Hence, C Tk is not a 
recursive function. Therefore, 7k is not a recursive set and so, by definition 
K is recursively undecidable. 


COROLLARY 3.46 

RR is essentially recursively undecidable. 

Proof 

RR and all consistent extensions of RR satisfy the conditions on K in 
Proposition 3.45 and, therefore, are recursively undecidable. (We take for 
granted that RR is consistent because it has the standard interpretation as a 
model. More constructive consistency proofs can be given along the same 
lines as the proofs by Beth (1959, § 84) or Kleene (1952, § 79).) 

We shall now show how this result can be used to give another derivation 
of the Godel- Rosser theorem. 


PROPOSITION 3.47 

Let K be a theory with a recursive vocabulary. If K is recursively ax- 
iomatizable and recursively undecidable, then K. is incomplete (i.e., K has an 
undecidable sentence). 

Proof 

By the recursive axiomatizability of K, there is a theory J with a recursive 
axiom set that has the same theorems as K. Since K and J have the same 
theorems, 7k = 7j and, therefore, J is recursively undecidable, and K is 
incomplete if and only if J is incomplete. So, it suffices to prove J incomplete. 
Notice that, since K and J have the same theorems, J and K must have the 
same individual constants, function letters, and predicate letters (because all 
such symbols occur in logical axioms). Thus, the hypotheses of Propositions 
3.26 an 3.28 hold for J. Moreover, J is consistent, since an inconsistent 
theory with a recursive vocabulary is recursively decidable. 

Assume J is complete. Remember that, if x is the Godel number of a wf, 
Clos(x) is the Godel number of the closure of that wf. By Proposition 3.26 
(16), Clos is a recursive function. Define: 

H(x) = /m[(Fml(x) A (Pf(y, Clos(x)) V Pf(v, Neg(Clos(x))))) V -Fml(x)] 

Notice that, if x is not the Godel number of a wf, H(x) = 0. If x is the Godel 
number of a wf 88, the closure of 88 is a closed wf and, by the completeness 
of J, there is a proof in J of either the closure of 8ft or its negation. Hence, 
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jl(x) is obtained by a legitimate application of the restricted ^-operator and, 
therefore, H is a recursive function. Recall that a wf is provable if and only if 
its closure is provable. So, x E 7} if and only if Pf(H(x), Clos(x)). But 
pf(H(x), Clos(v)) is recursive. Thus, 7) is recursive, contradicting the re- 
cursive undecidability of J. 

The intuitive idea behind this proof is the following. Given any wf SB, we 
form its closure # and start listing all the theorems in J. (Since PrAx is 
recursive, Church’s thesis tells us that J is an axiomatic theory and, there- 
fore, by the argument on page 86, we have an effective procedure for gen- 
erating all the theorems.) If J is complete, either ^ or will eventually 
appear in the list of theorems. If appears, SB is a theorem. If -(& appears, 
then, by the consistency of J, ^ will not appear among the theorems and, 
therefore, SB is not a theorem. Thus, we have a decision procedure for 
theoremhood and, again by Church’s thesis, J would be recursively decidable. 


COROLLARY 3.48 (GODEL-ROSSER THEOREM) 

Any consistent recursively axiomatizable extension of RR has undecidable 
sentences. 

Proof 

This is an immediate consequence of Corollary 3.46 and Proposition 3.47. 
Exercises 

3.56 Prove that a recursively decidable theory must be recursively ax- 
iomatizable. 

3.57 Let K be any recursively axiomatizable true theory with equality. (So, 
7k Q Tr.) Prove that K has an undecidable sentence. [Hint: Use Proposition 
3.47 and Exercise 3.48.] 

3.58 Two sets A and B of natural numbers are said to be recursively in- 
separable if there is no recursive set C such that A C C and B C C. (C is the 
complement co — C .) Let K be any consistent theory with equality in the 
language SB a in which all recursive functions are representable and such that 
|- K 0 I . Let Refic be the set of Godel numbers of refutable wfs of K, that 
is, (x|Neg(x) E 7k}. Prove that 7k and Reffc are recursively inseparable. 


DEFINITIONS 

Let Ki and K 2 be two theories in the same language. 

(a) K 2 is called a finite extension of Kj if and only if there is a set A of wfs 
and a finite set B of wfs such that: (1) the theorems of Ki are precisely 
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the wfs derivable from A; and (2) the theorems of K 2 are precisely tie 
wfs derivable from AuB. 

(b) Let KiuK 2 denote the theory whose set of axioms is the union of the set 
of axioms of Ki and the set of axioms of K 2 . We say that Ki and K 2 are 
compatible if KiuK 2 is consistent. % 


PROPOSITION 3.49 

Let K! and K 2 be two theories in the same language. If K 2 is a finite 
extension of Ki and if IC 2 is recursively undecidable, then Ki is recursively 
undecidable. 

Proof 

Let A be a set of axioms of Ki and A\j{28 1 , . . . , &„} a set of axioms for K 2 . 
We may assume that are closed wfs. Then, by Corollary 2.7, it is 

easy to see that a wf ^ is provable in IC 2 if and only if A ... A 88 n ) =>■ j s 
provable in Kj. Let c be a Godel number of {t%\ A ... A Then b is a 
Godel number of a theorem of K 2 when and only when 2 3 * c * 2 11 *b* 2 s is 
a Godel number of a theorem of Ki; that is, b is in 7 k 2 if and only if 
2 3 * c * 2 11 * b* 2 5 is in 7 k, . Hence, if 7L, were recursive, 7k 2 would also be 
recursive, contradicting the recursive undecidability of K 2 . 


PROPOSITION 3.50 

Let K be a theory in the language S£ a • If K is compatible with RR, then K is 
recursively undecidable. 

Proof 

Since K is comptatible with RR, the theory KuRR is a consistent extension 
of RR. Therefore, by Corollary 3.46, KuRR is recursively undecidable. 
Since RR has a finite number of axioms, KuRR is a finite extension of K. 
Hence, by Proposition 3.49, K is recursively undecidable. 


COROLLARY 3.51 

Every true theory K is recursively undecidable. 

Proof 

IC U RR has the standard interpretation as a model and is, therefore, con- 
sistent. Thus, K is compatible with RR. Now apply Proposition 3.50. 
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COROLLARY 3.52 

let Ps be the predicate calculus in the language !£ A . Then Ps is recursively 
pndecidable. 

proof 

p s uRR - RR- Hence, Ps is compatible with RR and, therefore, by Pro- 
position 3.50, recursively undecidable. 

By PF we mean the full first-order predicate calculus containing all 
predicate letters, function letters and individual constants. Let PP be the 
pure first-order predicate calculus, containing all predicate letters but no 
function letters or individual constants. 


LEMMA 3.53 

"There is a recursive function h such that, for any wf 88 of PF having Godel 
tnumber u, there is a wf 8$ of PP having Godel number h(u) such that 88 is 
provable in PF if and only if 8$ is provable in PP. 


Proof 


jLet 88 be a wf of PF. With the distinct function letters fj' in 88, associate 
fdistinct predicate letters Af 1 not occurring in 88, and with the distinct 

k,. ***».* 1 

individual constants cij in 88, associate distinct predicate letters A k not oc- 
curring in 88. Find the first individual constant cij in 88 (if any). Let z be the 
first variable not in 88 and let 88* result from 88 by replacing all occurrences 
of ctj by z. Form the wf 88 \ : ( 3z)A\{z ) =£> (3z)(A^(z) A 8$*), where A\ is the 
predicate letter associated with cij. It is easy to check (see the proof of 
Proposition 2.28) that 88 is logically valid if and only if 88 \ is logically valid. 
Keep on performing similar transformations until a wf £ without individual 
constants is reached; then £ is logically valid if and only if 88 is logically 
valid. Next, take the leftmost term ... , t„ ) in £, where t\ , . . . , t„ do not 
contain function letters. Let w be the first variable not in £, let result 


from £ by replacing ff(ti,...,t„) by w, and let £\ be the wf 
(3w)4 ,+1 (w, t u . . . , t„) =>• (3w)(^! +1 (w, ri,...,r„) A £*), where A'* 1 is the 
predicate letter associated with It is easy to verify that £ is logically valid 
if and only if £\ is logically valid. Repeat the same transformation on £\, 
and so on, until a wf 88' is reached that contains no function letters. Then 88 ’ 
is a wf of PP, and 88' is logically valid if and only if 88 is logically valid. By 
Godel’s completeness theorem (Corollary 2.19), 88 is logically valid if and 
only if hpF 88, and 88' is logically valid if and only if hpp 88' . Now, if u is the 
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Godel number of SB, let h(u) be the Godel number of 0 . When u is not the" 
Godel number of a wf of PF, define h(u) to be 0. Clearly, h is effectively 
computable because we have described an effective procedure for obtaining 
0 from SB. Therefore, by Church’s thesis, h is recursive. Alternatively, an 
extremely diligent reader could avoid the use of Church’s thesis by ‘ ar 
ithmetizing’ all the steps described above in the computation of h. 


PROPOSITION 3.54 (CHURCH’S THEOREM (1936a)) 

PF and PP are recursively undecidable. 

Proof 

(a) By Godel’s completeness theorem, a wf SB of Ps is provable in Ps if and 
only if SB is logically valid, and SB is provable in PF if and only if SB i s 
logically valid. Hence, Fp s SB if and only if F P1 , SB. However, the set 
Fmlp s of Godel numbers of wfs of Ps is recursive. Then 
7p s := TppriFmlpg, where ^Ps and Tpp are, respectively, the sets of Godel 
numbers of the theorems of P s and PF. If 7pp were recursive, 7p s would 
be recursive, contradicting Corollary 3.52. Therefore, PF is recursively 
undecidable. 

(b) By Lemma 3.53, u is in 7p F if and only if h(u ) is in Tpp. Since h is 
recursive, the recursiveness of Tpp would imply the recursiveness of Tpp, 
contradicting (a). Thus, 7pp is not recursive; that is, PP is recursively 
undecidable. 

If we accept Church’s thesis, then ‘recursively undecidable’ can be re- 
placed everywhere by ‘effectively undecidable’. In particular, Proposition 
3.54 states that there is no decision procedure for recognizing theoremhood, 
either for the pure predicate calculus PP or the full predicate calculus PF. By 
Godel’s completeness theorem, this implies that there is no effective method 
for determining whether any given wf is logically valid. 

Exercises 

3.59° 

(a) By a wf of the pure monadic predicate calculus (PMP) we mean a wf of 
the pure predicate calculus that does not contain predicate letters of 
more than one argument. Show that, in contrast to Church’s theorem, 
there is an effective procedure for determining whether a wf of PMP is 
logically valid. [Hint: Let B \ , B 2 , ■ ■ ■ , be the distinct predicate letters in 
a wf SB. Then SB is logically valid if and only if SB is true for every 
interpretation with at most 2 k elements. (In fact, assume SB is true for 
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every interpretation with at most 2 k elements, and let M be any inter- 
pretation. For any elements b and c of the domain D of M, call b and c 
equivalent if the truth values of #i(£),i? 2 (£)> • • • ,Bk(b) in M are, re- 
spectively, the same as those of B\(c), ^(c), . . . ,2?*(c). This defines an 
equivalence relation in D, and the corresponding set of equivalence 
classes has at most 2 k members and can be made the domain of an 
interpretation M* by defining interpretations of Bi,...,Bk, in the ob- 
vious way, on the equivalence classes. By induction on the length of wfs 
that contain no predicate letters other than B\ , . . . , Bk, one can show 
that is true for M if and only if it is true for M* . Since £8 is true for 
M , it is also true for M. Hence, £8 is true for every interpretation.) Note 
also that whether £8 is true for every interpretation that has at most 2 k 
elements can be effectively determined.]* 

(b) Prove that a wf £8 of PMP is logically valid if and only if £8 is true for all 
finite interpretations. (This contrasts with the situation in the pure 
predicate calculus; see Exercise 2.56 on page 93.) 

3.60 If a theory K* is consistent, if every theorem of an essentially recur- 
sively undecidable theory Ki is a theorem of K*, and if the property 
FmlKi (> ; ) is recursive, prove that K* is essentially recursively undecidable. 

3.61 (Tarski, Mostowski and Robinson, 1953, I) 

(a) Let K be a theory with equality. If a predicate letter A'j, a function letter 
fj and an individual constant aj are not symbols of K, then by possible 
definitions of A'fifJ and aj in K we mean, respectively, expressions of the 
form 


(i) (Vxi ) . . . ( Vx„ ) (A" (a'i , . . . , x„) 

(ii) (Vxi) . . . (Vx„ ) ( Vy ) (fij (xi , . . . 

(iii) (V>’)(tf/ =y& @{y)) 


£8(x \, . . . ,x„)) 

— y yy tffa, . . . ,*„,>')) 


where ^ and £8 are wfs of K; moreover, in case (ii), we must also have 
he (Vxi) . . . (Vx„)(3it)^(xi, . . . ,x„, j), and, in case (iii), b K (3\y)@(y). If 
K is consistent, prove that addition of any possible definitions to K as 
new axioms (using only one possible definition for each symbol) yields a 
consistent theory K 7 , and K 7 is recursively undecidable if and only if K is. 
(b) By a non-logical constant we mean a predicate letter, function letter or 
individual constant. Let Ki be a theory with equality that has a finite 
number of non-logical constants. Then Ki is said to be interpretable in a 
theory with equality K if we can associate with each non-logical con- 
stant of Ki that is not a non-logical constant of K a possible definition 


223 


*The result in this exercise is, in a sense, the best possible. By a theorem of 
Kalmar (1936), there is an effective procedure producing for each wf £8 of the pure 
predicate calculus another wf Sfti of the pure predicate calculus such that '28i contains 
only one predicate letter, a binary one, and such that £8 is logically valid if and only if 
$2 is logically valid. (For another proof, see Church, 1956, § 47.) Hence, by Church’s 
theorem, there is no decision procedure for logical validity of wfs that contain only 
binary predicate letters. (For another proof, see Exercise 4.68 on page 271.) 
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in K such that, if K" is the theory obtained from K by adding 
possible definitions as axioms, then every axiom (and hence every 1 ' 
orem) of Ki is a theorem of K*. Notice that, if Ki is interpretable 
it is interpretable in every extension of K. Prove that, if Kj is i 
pretable in K and K is consistent, and if Ki is essentially recurs 
undecidable, then K is essentially recursively undecidable. 

3.62 Let K be a theory with equality and Aj a monadic predicate lettei 
in K. Given a closed wf let (called the relativization of To with rtv 
to Aj) be the wf obtained from # by replacing every subformula (stai 
from the smallest subformulas) of the form (Vx)^(x) by (Vx) (A 1 (x) ^ % 
Let the proper axioms of a new theory with equality TO 4 ' be: (i) all wfs U 


where is the closure of any proper axiom of K; (ii) {Ax) A 1 - (x); (ifi) 
for each individual constant a m of K; and (iv) A 1 (xj )a 
A) (x„) =*► A l j(fl l (x i, . . . ,x„)) for any function letter Jf of K. Prove the I 
lowing. 

(a) As proper axioms of KA we could have taken all wfs where 
the closure of any theorem of K. 

(b) KA is interpretable in K. 

(c) K A J is consistent if and only if K is consistent. 

(d) R a j is essentially recursively undecidable if and only if K is (Tars 
Mostowski and Robinson, 1953, pp. 27-28). 

3.63 K is said to be relatively interpret able in K' if there is some predics 
letter Aj not in K such that K A > is interpretable in K'. If K is relativi 
interpretable in a consistent theory K' and K is essentially recursively m 
decidable, prove that K r is essentially recursively undecidable. 

3.64 Call a theory K in which RR is relatively interpretable sufficien 
strong. Prove that any sufficiently strong consistent theory K is essential! 
recursively undecidable, and, if K is also recursively axiomatizable, pm 
that K is incomplete. Roughly speaking, we may say that K is sufficient 
strong if the notions of natural number, 0, l, addition and multiplication a r 
‘definable’ in K in such a way that the axioms of RR (relativized to t 
‘natural numbers’ of K) are provable in K. Clearly, any theory adequate 
present-day mathematics will be sufficiently strong and so, if it is consistent 
then it will be recursively undecidable and, if it is recursively axiomatizable 
then it will be incomplete. If we accept Church’s thesis, this implies that ar 
consistent sufficiently strong theory will be effectively undecidable and, if h 
is axiomatic, it will have undecidable sentences. (Similar results also hold ft 
higher-order theories; for example, see Godel, 1931.) This destroys all Ik 
for a consistent and complete axiom atization of mathematics. 
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1 AN axiom system 

prime reason for the increase in importance of mathematical logic in the 
•nfieth century was the discovery of the paradoxes of set theory and the 
‘cf for a revision of intuitive (and contradictory) set theory. Many dif- 
renf axiomatic theories have been proposed to serve as a foundation for 
t theory but, no matter how they may differ at the fringes, they all have as 
corhmon core the fundamental theorems that mathematicians require for 
l eir daily work. We make no claim about the superiority of the system we 
;.all use except that, from a notational and conceptual standpoint, it is a 
onvenicnt basis for present-day mathematics. 

We shall describe a first-order theory NBG, which is basically a system of 
he same type as one originally proposed by von Neumann (1925; 1928) and 
later Thoroughly revised and simplified by R. Robinson (1937), Bernays 
1937-1954); and Godel (1940) (We shall follow Godel’s monograph to a 
reat extent, although there will be some significant differences.) 

NBG has a single predicate letter A\ but no function letter, or individual 
constants.^ In order to conform to the notation in Bernays (1937—1954) and 
Godel (1940), we shall use capital italic letters X\, X 2 , A 3 , ... as variables 
instead of xi , xj , X 3 , ... . (As usual, we shall use X, Y, Z, ... to represent 
rbitrary variables.) We shall abbreviate Aj(X, Y) by X E Y, and -lAjiX, Y ) 
y X£Y. 

Intuitively, e is to be thought of as the membership relation and the 
values of the variables are to be thought of as classes. Classes are certain 
collections of objects. Some properties determine classes, in the sense that a 
property P may determine a class of all those objects that possess that 
property. This ‘interpretation’ is as imprecise as the notions of ‘collection’ 
■•nd ‘property’. The axioms will reveal more about what we have in mind. 
I hey will provide us with the classes we need in mathematics and appear 
modest enough so that contradictions are not derivable from them. 

'We use A\ instead of A\ because the latter was used previously for the equality 
elation. 
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Let us define equality in the following way. 


DEFINITION 


X — Y for (VZ)(Z <E X ^ Z <E T) f 


Thus, two classes are equal when and only when they have the same 
members. 


DEFINITIONS 

IC7 for (VZ)(Z (inclusion) 

let for ICfAl^y (proper inclusion) 

When X C Y, we say that X is a subclass of Y. When X C Y, we say that X is 
a proper subclass of Y. 

As easy consequences of these definitions, we have the following. 


PROPOSITION 4.11 

(a) Y X =Y &{X <ZY f\Y CX) 

(b) hX = X 

(c) hX= Y^ Y = X 

(d) hX=Y^(Y = Z^>X = Z) 

We shall now present the proper axioms of NBG, interspersing among 
the axioms some additional definitions and various consequences of the 
axioms. 

We shall define a class to be a set if it is a member of some class. Those 
classes that are not sets are called proper classes. 


DEFINITIONS 

U(X) for (3F)(X e Y) (X is a set) 

Pr(X) for ->M(X) (X is a proper class) 

It will be seen later that the usual derivations of the paradoxes now no 
longer lead to contradictions but only yield the results that various classes 
are proper classes, not sets. The sets are intended to be those safe, com- 
fortable classes that are used by mathematicians in their daily work, whereas 

^ As usual, Z is to be the first variable different from X and Y. 

*The subscript NBG will be omitted from in the rest of this chapter. 
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^proper classes are thought of as monstrously large collections that, if per- 
mitted to be sets (i.e., allowed to belong to other classes), would engender 

contradictions. 


Exercise 4.1 Prove h X E Y =>■ M(X). 

The system NBG is designed to handle classes, not concrete individuals.! 
The reason for this is that mathematics has no need for objects such as cows 
and molecules; all mathematical objects and relations can be formulated in 
terms of classes alone. If non-classes are required for applications to other 
sciences, then the system NBG can be modified slightly so as to apply to 
both classes and non-classes alike (see the system UR in Section 4.6 below). 

Let us introduce lower-case letters xi, x 2 , ... as special restricted vari- 
ables for sets. In other words, (Vx,-)^?(x/) stands for (VY)(M(A) =y 8ft(X)), 
that is, 8ft holds for all sets, and (3 xj)8ft{xj) stands for (3X)(M(X) A 8ft{X)), 
that is, 8ft holds for some set. As usual, the variable X used in these defi- 
nitions should be the first one that does not occur in 8ft{xj). We shall use 
x, y, z, ... to stand for arbitrary set variables. 

Example 

( VAf! ) ( Vjc) (3^) (3AT 3 ) (AT G x A y G X 3 ) stands for 
(VX l )(VX 2 )(M(X 2 ) =» (3X 4 )(M(A4) A (3Y 3 )(X, g! 2 AX 4 gI 3 ))) 


Exercise 4.2 

Prove that I - X = Y & (Vz) (z eX & z E Y). This is the so-called exten- 
sionality principle : two classes are equal when and only when they contain 
the same sets as members. 


AXIOM T 


Xi — Xi =>• {X\ G X 3 ^ Xi G X$) 


This axiom tells us that equal classes belong to the same classes. 


Exercise 

4.3 Prove that h M(Z) AZ = Y=>M(Y). 


Tf there were concrete individuals (that is, objects that are not classes), then the 
definition of equality would have to be changed, since all such individuals have the 
same members (namely, none at all). 
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NBG is a first-order theory with equality. 

Proof 

Use Proposition 4.1, axiom T, the definition of equality, and the discussion 
on page 99. 


AXIOM P (PAIRING AXIOM) 


(Vjc) (Vy) (3z) (Vm) (u e z <=> u = xV k = y) 

Thus, for any sets x and y, there is a set z that has x and y as its only 
members. 

Exercises 

4.4 Prove h (Vx)(Vy)(3ii)(Vt/)(« E z u = x\/ u = y). This asserts that 
there is a unique set z, called the unordered pair of x and j', such that z has jr 
and y as its only members. Use axiom P and the extensionality principle. 

4.5 Prove h (V^)(M(2f) (3j)(2f G y)). 

4.6 Prove I- (3Y)Pr(Y) =* -(Vr)(VZ)(3JT)(W)(t/ G Z<* U = X V U = Y). 
AXIOM N (NULL SET) 


(3x)(Vj)(v £ x ) 

Thus, there is a set that has no members. From axiom N and the ex- 
tensionality principle, there is a unique set that has no members - - that is, 
h (3jx)(Vj')(y ^ x). Therefore, we can introduce a new individual constant 0 
by means of the following condition. 


DEFINITION 

(V>’)(v^0) 

It then follows from axiom N and Exercise 4.3 that 0 is a set. 

Since we have (by Exercise 4.4) the uniqueness condition for the un- 
ordered pair, we can introduce a new function letter g{x,y) to designate the 
unordered pair of x and y. In accordance with the traditional notation, we 
shall write {x,j>| instead of g(x,y). Notice that we have to define a unique 
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value for {X, F} for any classes X and F, not only for sets x and y. We shall 
let { X , F} be 0 whenever X is not a set or F is not a set. One can prove: 
h (3,Z)([(- ,M (^) V -iM(F)) A Z = 0] V [M(JF) A M(F) A (Vm)(m e Z <* u = 
v u — F)]). This justifies the introduction of a term {X, F} satisfying the 
following condition: 

[M(Y) A M(F) A (Vu)(u 6 {X, Y} & u = X V u = F)] 

V [(-M(X) V ->M(F)) A {X, F} = 0] 

One can then prove h (Vx) (Vy) (Vm) (w € {x,t} O u = x V « = y) and 
F (VA r )(VF)M({df, F}). 


DEFINITION 


{X} for {Y,X} 


For a set x, {x} is called the singleton of x. It is a set that has x as its only 
member. 

In connection with these definitions, the reader should review Section 2.9 
and, in particular. Proposition 2.28, which assures us that the introduction 
of new individual constants and function letters, such as 0 and { X , F}, adds 
nothing essentially new to the theory NBG. 


Exercise 

4.7 (a) Prove h {X, Y} — {Y,X}. 

(b) Prove h (Vx)(V>’)({x} = {y} x = y). 


DEFINITION 

(X, Y) for {{X}, {X, r}} 

For sets x and y, (x, >’) is called the ordered pair of x and y. 

The definition of (X, Y) does not have any intrinsic intuitive meaning. It 
is just a convenient way (discovered by Kuratowski, 1921) to define ordered 
pairs so that one can prove the characteristic property of ordered pairs 
expressed in the following proposition. 


PROPOSITION 4.3 


h (Vx) (Vy) (Vw) (Vu) ( (x, y) = (a, v) => x = u A y = t?) 




Proof 
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Assume (x, y) = (u, v). Then {{x},{x,y}} = {{«}, {w, a}}. Since {x} g {{x} 
{x,t}}, M G {{«}, {«, t’}}. Hence, {x} = {w} or {x} = { u , v}. In either case’ 
x = u. Now, {u, v} G {{«}, {«, v}}; so, {«> y } C {{*}, {x,t}}- Then {«, a} - 
{x} or {u,v} = {x,y}- Similarly, {x,j} = {«} or {x,>>} = {u,v}. if 
{«, a} = {x} and {x,t} = {«}, then x=y = u = v; if not, {«, a} = {x,jd. 
Hence, {u, 0 } = {«,y}. So, if a ^ u, then y = 0 ; if a = u, then y = 0 . Thus, in 
all cases, y = v. 

Notice that the converse of Proposition 4.3 holds by virtue of the sub- 
stitutivity of equality. 


Exercise 

4.8 (a) Show that, instead of the definition of an ordered pair given in the 
text, we could have used ( X , Y) = {{0,X}, {{0}, 7}}; that is, Proposition 4.3 
would still be provable with this new meaning of ( X , Y). 

(b) Show that the ordered pair also could be defined as {{0,{X}}, 
{{7}}}. (This was the first such definition, discovered by Wiener (1914). For 
a thorough analysis of such definitions, see A. Oberschelp (1991).) 

We now extend the definition of ordered pairs to ordered n-tuples. 


DEFINITIONS 


<X)=X 

(X\, • • • jA/jjA/j-fi) \{Xi, . . . , X n ^ , X n ^. 1 ) 


Thus, (X, Y,Z) — (( X , 7},Z) and (X, Y,Z, U) = <((X, Y),Z), U ). 

It is easy to establish the following generalization of Proposition 4.3 


(V^n) . . . ( Vx n ) ( Vj ; [ ) . . . (V3/i)((xi, . . . .,X/j) (yi, * * • 

xi = y\ A... Ax„ = y „ ) 


AXIOMS OF CLASS EXISTENCE 


(Bl) (3X)(Vw)(Va)((w, a) G X & u G v) 

(B2) (VX) (V7)(3Z) (Vw) (u G Z a G X A u G Y) 

(B3) (VX)(3Z)(Vw)(w £Z^u£X) 

(B4) (VX) (3Z) (V«) (m GZo (3a)({«, d)gI)) 

(B5) (VX) (3Z) (Vm) (Va) ( (it, a) G Z & it G X) 

(B6) (VX)(3 Z)(Vu)(Vv)(Vw)((ii, v,w) eZ<& (a, vv, it) 
(B7) (VX)(3Z)(V«)(Vr)(Vw)((H, a , w) G Z o (u,w, a) 


(g -relation) 
(intersection) 
(complement) 
(domain) 

GX) 

GX) 
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From axioms (B2) (B4) and the extensionality principle, we obtain: 

(VY)(VF)(3iZ)(Vt/)(// G Z u G -A A u G F) 

F (VX)(3iZ)(V«)(i/ t/^X) 

h (VX)(3]Z)(Vt/)(// GZ^ (3 u)((i/, t>) g X)) 

These results justify the introduction of new function letters: n, ~ and Q}. 


definitions 

(Vi/)(m G Xr\Y G X An G F) (intersection of X and F) 

(V«)(w G X <*=> i/^X) (complement of X) 

(V«)(w G £^(X) (3n)((t/, r) G X)) (domain of X) 

Iuf_= XnF (union of X and F) 

V = 0 (universal classy 

X — Y ~XnY (difference of X and F) 

Exercises 

4.9 Prove: 

(a) h (Vi/)(w G XkjY u E XV u E Y) 

(b) h (Vn) (u E V) 

(c) h (Vi /) (u EX -Y <*u EX Au$Y) 

4.10 Prove: 

(a) h XnY — YnX 

(b) h IuF= FuJ 

(c) hXC F <^XnF=X 

(d) hIC F JuF = F 

(e) h (Fnf)nZ = Jn(fnZ) 

(f) h (XuF)uZ = Xu(FuZ) 

(g) hInI=F 

(h) h XuX = A 

(i) h Xn0 — 0 
0) hXu0=X 
(k) h XnV=X 

4.11 Prove the following wfs. 

(a) h (VX)(3Z)(Vi/)(Vn)((i/, v) E Z <=> (v,n) EX) [Him: Apply axioms 
(B5),(B7),(B6) and (B4) successively.] 

(b) h (VX) (3Z) (Vi/) (Vr)(Vw) ( (i/, a, w) E Z t=> (t/, w) G X) [Efiwf ; Use 
(B5) and (B7).] 

(c) h (VX) (3Z) (Vv) (Vxi ) . . . (Vx„)( (xi , . . . , x „ , r) G Z <*=> (xi , . . . , x„) E X) 
[Him: Use (B5).] 


(l) FXuF = V 

(m) HXuF = InF 

(n) hXnF = XuF 

(o) hX-X = 0_ 

(p) h V-X = X 

(q) h X - (X - Y) = XnY 

(r) hFCX=^X-F = X 

(s) hX=X 

(t) h V = 0 

(u) hXn(fuZ) — (TnF)u(XnZ) 
(V) hXu(FnZ) = (XuF)n(XuZ) 
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tit will be shown later that V is a proper class, that is, V is not a set. 



• • • 


• 4 » 


; Vnt) E Z 



AXIOMATIC SET THEORY 


(d) 

(e) 


(f) 

(g) 

(h) 
(0 


h (VX)(BZ)(Vvi ) . . . (Vv,„)(Vxi ) . . . (Vx„)((xi , . . . ,x„, v h 
yy (xj , . . . ,x„) £ X ) [Hint: Iteration of part (c).] 
h (VX)(3Z) (Vri). . .(Vty,,) (Vxi). . .(Vx„) ((x u . . . ,x n _i, v u . . . , 

(E Z < 4 - (x i , . . . ,x w ) Gl) [/fmt: For /w = 1, use (b), substituting 
(xj, . . . ,x„_i) for u and x„ for w; the general case then follows by 
iteration.] 


h (VX)(3Z) (Vx) (Viy ) . . . (Viy„) ((^ , . . . , v m , x) EZ yyx EX) [Hint: 
Use (B5) and part (a).] 

h (VX) (3Z) (Vxi) . . . (Vx„) ({xi, . . . ,x„) EZ & (3_f) ((xi, . . . ,x„,j;) 
E X)) [Hint: In (B4), substitute (x„ , . . . ,x„) for u and y for v.] 
h (VX)(3Z)(Vw)(Vr)(Vw)((iy u, w)E Z yy (u, w) E X) [Hint: Substi- 
tute ( u , w) for u in (B5) and apply (B6).] 

h (VX) (3 Z) (Vri) . . . (Vv/t) (Vu)(Vw)((v i, . . . , Vk, u, w) £ Z yy (u,w) 
E X) [Hint: Substitute (ty , . . . , iy) for v in part (h).] 


Now we can derive a general class existence theorem. By a predicative wf 
we mean a wf <p(X \, . . . ,X,„ F 1} . . . , Y m ) whose variables occur among 
X 1} . . . ,X„, Yi , . . . , Y tn and in which only set variables are quantified (i.e., q> 
can be abbreviated in such a way that only set variables are quantified). 


Examples 

(3x'i)(xi E li) is predicative, whereas (3Fj)(xi £ hi) is not predicative. 


PROPOSITION 4.4 (CLASS EXISTENCE THEOREM) 

Let <p(X i , . . . ,X H , Y\ , . . . , Y m ) be a predicative wf. Then 

"*A 

h (3Z)(Vxi) . . . (Vx„)((xi, . . . ,x„) EZ yy <p(x l5 . . . ,x„, Fi, . . . , Y„)). 


Proof 

We shall consider only wfs <p in which no wf of the form Y f E W occurs, since 
Yi E W can be replaced by (3x)(x = Y,- Ax £ W), which is equivalent to (3x) 
[(Vz)(z Ex yy z e Yi) Ax E W]. Moreover, we may assume that tp contains 
no wf of the formX E X, since this may be replaced by (3 u){u = X A u eX), 
which is equivalent to (3w)[(Vz)(z EuyyzEX)AuE X]. We shall proceed 
now by induction on the number k of connectives and quantifiers in <p 
(written with restricted set variables). 

Base: k = 0. Then <p has the form xy E x t - or x f E X/ or x,- £ Yg, where 
1 ^ / < j^n. For X/ £ xj, axiom (Bl) guarantees that there is some W\ such 
that (Vx/) (Vxy ) ( (xj , Xy ) E W\ yy x,- E xj). For xj e x,-, axiom (Bl) implies that 
there is some Wi such that (Vx,)(Vxy)((x/,xy) E Wi Xj G Xj) and then, by 
Exercise 4.11(a), there is some W$ such that (Vx,)(xy)((x/,xy) E W$ yy xy E xy). 
So, in both cases, there is some W such that (Vxy)(Vxy)((xy,xy) E W yy 
(p(x \, . . . ,x„, Fi, . . . , Y m )). Then, by Exercise 4. 1 l(i) with W —X, there is 
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some Z\ such that (Vxi) . . . (Vx,_i)(Vx,)(Vxy)((xi, . . . ,x/_i,x,-,xy) G Z\^ 
0 (jq , . . • , x„, Y \ , . . . , Y m )). Hence, by Exercise 4.1 1(e) with Z\ — X, there exists 
% 2 such that (Vxi) . . . (Vx,) (Vx, + i) . . . (Vxy)((x,, . . . ,xj) G Z\ o <p(x t , . . . , x„ , 
f j , . . • , F,„ ) ) • Then, by Exercise 4.11(d) with Z 2 = X, there exists Z such that 
(Vxi) . • . (Vx„)((xi ,...,x„)eZ^ <p(*i , . . . , x,„ Yi , . . . , i;,)). In the remaining 
case, x,- g Y(, the theorem follows by application of Exercise 4.11(f,d). 

Induction step. Assume the theorem provable for all k <r and assume 
that q> has r connectives and quantifiers. 

(a) (p is - 1 ^. By inductive hypothesis, there is some W such that 
(Vxi ) • • • (Vx„) ( (xj , . . . , x n ) G IV ij/ (xj , . . . , x n ; I) , . . . , Y nj ) ) . Let Z = IV . 

(b) (p is =>- D. By inductive hypothesis, there are classes Z\ and Z 2 such 
that (Vx ( ). . .(Vx„)((xi,. . .,x„) eZ x <J=^(xi, . . . ,x„, Yi , . . . , Y; n )) and_ (Vxi) 
. . . (Vx /; )((xi , • • • , x w ) G Z 2 i9(xj j . . . , X//) Y \ , . . . , Y t) 1 )). Let Z Z\ c\Z 2 . 

(c) <p is (Vx)^. By inductive hypothesis, there is some W such that 

(Vxi ) . . .(Vx„) (Vx) ((xi , . . . , x„ , x) e_W <=> \}/(xi . . , x„ , x, Y \ , . . . , F„,)) . Apply 

Exercise 4.11(g) with X —W to obtain a class Z\ such that 
(Vxi)^ .(Vx„)((xi , . . . ,x„) G Zi <£> (3x)-u/^(xi , . . . , x„, x, Yy , . . . , F w )) Now let 
Z—Z\, noting that (Vx)i// is equivalent to -i(3x)-ii/o 

Examples 

1. Let (p{X , Fi, F 2 ) be (3n)(3t;)(X = (w, v) A u G Y\ A v G I 2 )• The only 
quantifiers in <p involve set variables. Hence, by the class existence the- 
orem, h (3Z)(Vx)(x G Z (3n)(3r>)(x = (m, c) AmG Y x Are F 2 )). By the 
extensionality principle, 

h (3iZ)(Vx)(x G Z<^ (3w)(3f)(x = ( u , v) A u G Fi A u G F>)- 
So, we can introduce a new function letter x . 

DEFINITION 

{Cartesian product of F ( and I 2 ) 

(Vx)(x G Yy x F> ^ (3w)(3f)(x =(m,d) A«Gh AdG F 2 )) 

DEFINITIONS 

F 2 for F x F 

F" for Y" ] x F when /? > 2 

Rel(Y) for X C V 2 {X is a relation ) + 

V 2 is the class of all ordered pairs, and V" is the class of all ordered 
tuples. In ordinary language, the word ‘relation’ indicates some kind of 
connection between objects. For example, the parenthood relation holds 


t More precisely, Rel(Y) means that X is a binary relation. 
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between parents and their children. For our purposes, we interpret the 
parenthood relation to be the class of all ordered pairs ( u , v) such that u is § 
parent of v. 


2. Let (p{X , F) be X C F. By the class existence theorem and the cx- 
tensionality principle, h (3|Z)(Vx)(x gZojC F). Thus, there is a un- 
ique class Z that has as its members all subsets of F. Z is called the power 
class of F and is denoted £P(Y). 


DEFINITION 

(Vx)(xe 0>(Y) <^xc f) 

3. Let <p(X, F) be (3r)(A e vAv e F)). By the class existence theorem and 
the extensionality principle, h (3i Z)(Vx)(xeZ<^ (3n)(* E V A V £ 
Thus, there is a unique class Z that contains all members of members of 
F. Z is called the sum class of F and is denoted (J F. 


DEFINITION 

(Vx)(x E (J F <*=> (3r)(x E v A v E Y)) 

4. Let (p(X) be (3 u)(X — (m,m)). By the class existence theorem and the 
extensionality principle, there is a unique class Z such that (Vx) (x e Z 
(3m) (x = (m, «))). Z is called the identity relation and is denoted I. 

DEFINITION 

(Vx)(x El ^ (3w) (x = (m, m))) 


COROLLARY 4.5 

H >(*„..• , X„, Fi, . . . , Y m ) is a predicative wf, then 

I- (3i W){W C V” A (Vxi ) . . . (Vx„)((x l5 . . . ,x„) E W & (p{x u ...,x„,Y u . 



Proof 


By Proposition 4.4, there is some Z such that (Vxi) . . . (Vx„)((xi, . . . ,x„) 
E Z <$=> <p(xi, . . . ,x„, Fi, . . . , Y m )). Then W = ZnV" satisfies the corollary, 
and the uniqueness follows from the extensionality principle. 


DEFINITION 

Given a predicative wf <p(X l} . . . ,X„, F|, . . . , Y m ), let 

{(x] , . . . ,x„)|f/)(xi , . . . , x„, Y \ , . . . , F„,)} denote the class of all / 7 -tuples 
(xi , . . . ,x„) that satisfy <p(xi , . . . ,x„, Y \ , . . . , Y m \, that is, 


» » • 
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(Vw) (m £ { (Aj , . . . , Xfj ) | (p (Xi > . . • , X n j j . . . , Fji ) } 

(3xi) . . . (3X;i)(m — (a'j , . . . ,X n ) A <p(Xi, . . . ,Xji, Y\ , . . . , Fn))) 

This definition is justified by Corollary 4.5. In particular, when n = 1, 
ff— ^jii)(u £ Fj , . . . , 1/n)} h| , . . . , 

* Examples 

1. Take <p to be (x 2 ,*i) € F. Let Y be an abbreviation for 

* 2 ) I (* 2 ^* 1 ) £ F}. Hence, F C V 2 A ( V?d ) (Vjc 2 ) ( {-^ 1 , -^ 2 ) 

G Y). Call F the inverse relation of Y. 

2. Take q> to be (3r)((r,x) G F). Let £%(F) stand for {jc| (3t>) (( 1 ;, jc) G F)}. 
Then h (Vu)(u € 8%{Y) ^ (3v)((i;,^) G F)). M(Y) is called the range of F. 
Clearly, h ^(F) = ^(F). 

Notice that axioms (B1)-(B7) are special cases of the class existence 
theorem, Proposition 4.4. Thus, instead of the infinite number of instances 
of the axiom schema in Proposition 4.4, it sufficed to assume only a finite 
number of instances of that schema. 


Exercises 


4.12 Prove: 

(a) HJ0 = 0 

(b) h U(0} = 0 

(c) \~\JV=V 

(d) h&(V) = V 

(e) hICf=> \JX C (J F A 0>(X) C 0>{Y) 

(f) h =X 

(g) hXC^fljY) 

(h) h (XnY) x ( WnZ ) = (Ix JV)n(Y x Z) 

(i) h (JuF) x (VuZ) = (X x W)kj(X x Z)u(F x W)n(Y x Z) 

(j) h ^(XnY) = ^(X)n^(Y) 

(k) h 3P{X)vj0>{Y) C &{X\jY) 

(l) What simple condition on X and F is equivalent to 
^(XuF) C ^(X)u^(F)? 

(m) i-U(Auy) = (Un^(UA 

(n) HU(AnF)C(UX)n((Jr) 

(o) h z=Y =t-Z = YnV 2 

(p) I- Rel(7) A 1 = 1 

(q) h fl»(0) = {0} 

(r) h ^({0}) = {0, {0}} 

(s) h (Vx)(Vv)(x xy C ^(^(x\jy))) 

(t) h Rel(F) => Y C 9(Y) x 99(Y) 
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Until now, although we can prove, using Proposition 4.4, the existence of 
a great many classes, the existence of only a few sets, such as 0, {0}, {0, {0jT 
and {{ 0 }}, is known to us. To guarantee the existence of sets of greater 
complexity, we require more axioms. 


AXIOM U (SUM SET) 


(Vx)(3y)(Vi/)(w Cyft (3y)(« G v A v G x)) 

This axiom asserts that the sum class (Jx of a set x is also a set, which we 
shall call the sum set of x, that is, I- (Vx)M((Jx). The sum set (Jx is usually 
referred to as the union of all the sets in the set x and is often denoted II i, 


Exercises 

4.13 Prove: 

(a) h (Vx)(Vy)(|J{x,T} = 

(b) H (Vx) (Vy) M (xuy) 

(c) h (Vx) (U{x> = x) 

(d) h (Vx) (Vy) (|J (x, y) = {x,^}) 

4.14 Define by induction (xj, . . . ,x„} to be {x 1( . . . ,x„„i}u{x„}. Prove 

I- (Vx ( ) . . . (Vx„)(Vm)(m g (x[, . . . , x„ } u —X\ V...V« = Xf t ) Thus, for any 
sets Xi , . . , ,x„, there is a set that has x\ , . . . ,x w as its only members. 

Another means of generating new sets from old is the formation of the set 
of subsets of a given set. 


AXIOM W (POWER SET) 

(Vx) (3y) (Vw) (uGj^u Cx) 

This axiom asserts that the power class 8P(x) of a set x is itself a set, that is, 
h (Vx)M(^(x)). 

A much more general way to produce sets is the following axiom of 
subsets. 


AXIOM S (SUBSETS) 


(Vx)(VT)(3z)(Vm)(m gz^hGxAmG T) 


COROLLARY 4.6 
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(a) I - (V*)(VF)M(jcnF) (The intersection of a set and a class is a set.) 

(b) F (Vjc)(VF)(F C x =4- M(F)) (A subclass of a set is a set.) 

(c) For any predicative wf 3ft(y), h (Vx)M({y[y E * A 3ft (y)}). 

Proof 

(a) By axiom S, there is a set z such that (Vw)(m Ez<=>wExAmE F), 
which implies (Vn)(w ez^-«G xnF). Thus, z — xnY and, therefore, 
rnf is a set. 

<b) if r C x, then rnf = Y and the result follows by part (a). 

(c) Let Y = {y\y E x A 3ft (y)}. Since 7 Cr, part (b) implies that Y is a set. 

Exercise 
4.15 Prove: 

(a) h (V*)(M(0(*)) A M{@{x))). 

(b) I- (Vx)(Vy)M(x x y). [Hint: Exercise 4.12(s).] 

(c) b M (0(F)) A M(0(F)) A Rel(F) M(F). [ Hint: Exercise 4.12(t).] 

(d) F Pr( F) A F C X =» Pr(A r ). 

On the basis of axiom S, we can show that the intersection of any non- 
empty class of sets is a set. 


DEFINITION 

P)A for {y| (Vx)(x E X =4- y 6 x)} (intersection) 


PROPOSITION 4.7 

(a) F (Vx) (x f)* C *) 

(b) hX^0=^M(D^) 

(c) Ff)0=L 

Proof 

(a) Assume u E X. Consider any y in f]X. Then (Vx) (xg! =>- p e x) . 
Hence, y E u. Thus, C u. 

(b) Assume X ^ 0. Let x eX. By part (a), p| Y Ci Hence, by Corollary 
4.6(b), is a set. 

^More precisely, the wf Y E X A 3ft(Y) is predicative, so that the class existence 
theorem yields a class {y\y G X A 3ft (y)}. In our case, X is a set x. 
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(c) Since h (Vjc) (x £ 0), I- (Vy) (Vx) (x G x), from which we obtain 

h (V y)(y G f)0). From h (Vy) (y G V) and the extensionality principle 

hf|0 = P. 


Exercise 

4.16 Prove: 

(a) hClM^xny 

(b) h f|{x} = x 

(c) h/C fl fc f]X 

A stronger axiom than axiom S will be necessary for the full development 
of set theory. First, a few definitions are convenient. 


DEFINITIONS 

Fnc(X) for Rel(X) A (Vx)(Vy)(Vz)({x,y) G X A {x, z) G X => y = z) 

(X is a function ) 

X : Y — * Z for Fnc(X) A &{X) = Y A 8%{X) C Z {X is a function from Y into Z) 
Y[X for Xr\(Y x V) {restriction ofX to the domain Y) 

Fnc[(Y) for Fnc(Af) A Fnc(Y) (X is a one — one function) 

X , Y= \ Z if (Vu)((Y,u)gX u = z) 

1 0 otherwise 
X“Y = &(YIX) 

If there is a unique z such that (y, z) G X, then z = X*y; otherwise, 
X l y = 0. If X is a function and y is a set in its domain, X ( y is the value of the 
function applied to y. If X is a function, X U Y is the range of X restricted to 

y.t 


Exercise 


4.17 Prove: 

(a) h Fnc(X) Ay G &{X) => (Vz)(2f‘y = zo (y,z) eX) 

(b) h Fnc(X) AY c 3>{X) =y Fnc(FfX) A 9{Y[X) ~Y A (Vy) (y G 
X‘y=(Y(X)‘y) 

(c) h Fnc(X) =*- [Fnc i (X ) « (Vj)(Vz)(y 6 &(X) A z e ®(X)/\y^z=y 

X'y^X'z)} 

(d) h Fnc(Jf) AFC S(X) = 5 . (Vz)(z eX“Y (3y)(y e Y AX‘y = z)) 


*In traditional set-theoretic notation, iff 7 is a function and y is in its domain, Fy 
is written as F(y), and if Y is included in the domain of F, F U Y is sometimes written 
as F[F]. 
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AXIOM R (REPLACEMENT) 


Fnc(F) => (Vx) (3y) (V m) (m Gy ■<=> (3u)((i;, u) G F AdGi)) 

Axiom R asserts that, if F is a function and x is a set, then the class of second 
components of ordered pairs in F whose first components are in x is a set 
(or, equivalently, 3%{x[Y) is a set). 


Exercises 

4.18 Show that, in the presence of the other axioms, the replacement axiom 
(R) implies the axiom of subsets (S). 

4.19 Prove I- Fnc(F) => (Vx)M(F“jc)). 

4.20 Show that axiom R is equivalent to the wf 
Fnc(F) A M (0(F)) => M (02(F)). 

4.21 Show that, in the presence of all axioms except R and S, axiom R is 
equivalent to the conjunction of axiom S and the wf 

Fncj(F) A M(S(F)) =* M(5?(F)). 

To ensure the existence of an infinite set, we add the following axiom. 


AXIOM I (AXIOM OF INFINITY) 


(3x)(0 G x A (Vw)(w Cx=> wu{«} G x)) 

Axiom I states that there is a set x that contains 0 and such that, whenever a 
set u belongs to x, then u u {«} also belongs to x. Hence, for such a set x, 
{0} G x, (0, {0}} G x, (0, {0}, (0, {0}}} G x, and so on. If we let 1 stand for 
{0}, 2 for (0, 1}, 3 for {0, 1,2 }, ... for (0, 1 , 2 , ... ,n — 1}, etc., then, for 
all ordinary integers n >0, n ex, and 0^1, 0^2, 1^2, 0^3, 1^3, 
2 ^ 3 , 


Exercise 

4.22 (a) Prove that any wf that implies (ELF)M(X) would, together with 

axiom S, imply axiom N. 

(b) Show that axiom I is equivalent to the following sentence (/*): 
(3x)((3y)(y G x A (Vw)(w >’)) A (Vw)(w Gx=> wu{w} G x)) 

Then prove that (/*) implies axiom N. (Hence, if we assumed (/*) instead of 
(I), axiom N would become superfluous.) 
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This completes the list of axioms of NBG, and we see that NBG has only a 
finite number of axioms - namely, axiom T, axiom P (pairing), axiom N 
(null set), axiom U (sum set), axiom W (power set), axiom S (subsets), axiom 
R (replacement), axiom 1 (infinity), and the seven class existence axioms 
(B1)-(B7). We have also seen that axiom S is provable from the other 
axioms; it has been included here because it is of interest in the study of 
certain weaker subtheories of NBG. 

Let us verify now that the usual argument for Russell's paradox does not 
hold in NBG. By the class existence theorem, there is a class Y — {jc|jc £ x\. 
Then (Vjc) (. r E l 7 <=) x<£x). In unabbreviated notation this becomes 
(VY)(M(W) =>• (X e Y <&X£X)). Assume M(F). Then Y e Y Y<£Y y 
which, by the tautology (A <=> ->A) => (A A ~>A), yields Y € Y A Y £ Y. Hence, 
by the derived rule of proof by contradiction, we obtain I — iM ( F) . Thus, in 
NBG, the argument for Russell’s paradox merely shows that Russell’s class 
Y is a proper class, not a set. NBG will avoid the paradoxes of Cantor and 
Burali-Forti in a similar way. 


Exercise 

4.23 Prove I — <M(L), that is, the universal class V is not a set. [Hint: Apply 
Corollary 4.6(b) with Russell’s class F.] 


4.2 ORDINAL NUMBERS 

Let us first define some familiar notions concerning relations. 


DEFINITIONS 

X Irr Y for Rcl(Y) A (Vy)(y e Y => (y,y) £X) 

(X is an ir reflexive relation on F) 

X Tr F for Rel(Y) A (V«)(Vy)(V>v)([w eY AveY AwEYA 

(u, v) E X A ( v , vv) (u, >v) E X) 

(X is a transitive relation on F) 

X Part F for (X Irr F) A (T Tr F) (X partially orders Y) 

X Con F for Rel(Y) A (Vh)(Vi>)([h EY AveY Au^v]^ 

(u, v) EX V ( v , u) E X) 

( X is a connected relation on F) 

X Tot F for (X Irr F) A (X Tr F A {X Con F) ( X totally orders Y) 

X We F for (X Irr F) A (VZ)([Z C F A Z # 0] =* (3y)(y eZ A 

{Vv){v EZ Av^y=> (y,v) EX A (v,y) £X))) 
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(X well-orders Y, that is, the relation X is irreflexive on Y and every non- 
empty subclass of Y has a least element with respect to X) 


Exercises 

4.2 4 Prove h X We Y =y X Tot Y. [ Hint : To show X Con Y, let x E Y A y 
E Y A x 7 ^ y. Then {x,y} has a least element, say x. Then (jc,^) e X. To show 
X Tr Y, assume x € Y A y € Y A z e Y A (x,y) e XA (y, z) € X. Then {x,p, z} 
has a least element, which must be x.] 

4.25 Prove YX We Y AZ C Y =^X We Z. 

Examples (from intuitive set theory) 

1 . The relation < on the set P of positive integers well-orders P. 

2. The relation < on the set of all integers totally orders, but does not well- 
order, this set. The set has no least element. 

3. The relation C on the set W of all subsets of the set of integers partially 
orders W but does not totally order W. For example, {1} ^ {2} and 

{ 2 } t {!}• 


DEFINITION 

Simp(Z, Wi, Wi) for 

(3x 1 )(3x2)(3n)(3r 2 )(Rel(n) A Rel(^) AW\ — <ri,xi) A W 2 = (r 2 ,x 2 ) 

A Fnci(Z) A 9[2) ~ Xi A 8%{Z) = x 2 A (Vw)(Vt')(« e xi A v E x\ => 

((«,y) G n (Z‘w,Z‘y) G /*2))) 

(Z is a similarity mapping of the relation r\ on xi onto the relation t'l on x 2 . ) 


DEFINITION 


Example 


Sim(tfi, W 2 ) for (3z)Simp(z, Wi , W 2 ) 

(W] and W 2 are similar ordered structures) 


Let i‘i be the less-than relation < on the set A of non-negative integers 
(0, 1, 2, . . and let r 2 be the less-than relation < on the set B of positive 
integers { 1, 2, 3, . . Let z be the set of all ordered pairs (x,jc + 1) for x E A. 
Then z is a similarity mapping of {r\,A) onto ( r 2 ,B ). 


DEFINITION 

X\ oX 2 for {(u,v)\(5z)((u,z) eX 2 A{z,v) GZi)} 

(the composition of X 2 and X\ ) 
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Exercises 

4.26 Prove: 

(a) h Simp(Z,X, Y) => M(Z)AM(I) A M(7) 

(b) b Simp (Z, X, Y) Simp(Z, Y,X) 

4 . 27 

(a) Prove: b Rel(Xj) A Rel(X 2 ) => Rel(Xj oI 2 ) 

(b) Let X\ and X 2 be the parent and brother relations on the set of human 
beings. What are the relations X\ o X\ and X\ o X 2 ? 

(c) Prove: b F nc {X\ ) A Fnc(X 2 ) =4> Fnc(X| 0 X 2 ) 

(d) Prove: b FnC[ (Xi) A FtiC| (X 2 ) Fnc] (X\ 0 X 2 ) 

(e) Prove: b (X l : Z -> W /\X 2 : Y -► Z) =*► Xj oX 2 : Y -> W 

DEFINITIONS 

Fld(X) for @{X)um(X) (the field of X) 

TOR(X) for Rel(X) A (X Tot (Fld(X))) {X is a total order ) 

WOR(X) for Rel(X) A {X We (Fld(X))) (X is a well — ordering relation ) 

Exercise 

4. 28 Prove: 

(a) b Sim (Hi, W 2 ) => Sim( W 2 ,Wi) 

(b) b Sim( W\ , W 2 ) A Sim(fb 2 , W 3 ) => Sim(ffi , W 3 ) 

(c) b Sim( (X, Fld(X) ) , ( Y, FId( Y) ) ) =» (TOR(X) 4 * TOR( 7) ) A ( WOR(X) 

« ^WOR(T)) 

If x is a total order, then the class of all total orders similar to x is called 
the order type of x. We are especially interested in the order types of well- 
ordering relations, but, since it turns out that all order types are proper 
classes (except the order type {0} of 0), it will be convenient to find a class W 
of well-ordered structures such that every well-ordering is similar to a un- 
ique member of W. This leads us to the study of ordinal numbers. 


DEFINITIONS 

E for {(*,>’) I* £ y} (the membership relation) 

Trans(X) for (\/u)(u £ X =y u C X) (X is transitive) 

Sect Y (X, Z) for 

z c x a (Vk)(Vi?)([k e x a v e z a (u, v) e Y]=>ueZ) 

( Z is a Y-section of X, that is, Z is a subclass of X and every 
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member of X that F-precedes a member of Z is also a 
member of Z.) 

Seg y(X IF ) for {.v \x gIa (x, W) £ F}(the Y-segment of X determined 
by W y that is, the class of all members of X that F-precede W) 


Exercises 

4.29 Prove: 

(a) l - Trans (2f) (Vw)(Vr)(r £ u Am £ X =»► v £ X s ) 

(b) b Trans(X) (JX C X 

(c) I- Trans (0) 

(d) b Trans({0}) 

(e) b Trans (X) A Trans(F) =4- Trans(XuF) A Trans(XnF) 

(f) b Trans (X) =4- Trans ((JX) 

(g) b (Vw)(m £ X) =4- Trans(w)) =>• Trans((JX) 

4.30 Prove: 

(a) b (Vw) [Seg E (X, u) = Xr\u A M(Seg E (X, i/))l 

(b) b Trans(X) (Vw)(m £ X =* Seg E (X, u) = u) 

(c) b E We X A Sect E {X, Z)AZ/I^ (3 u)(u £ X A Z = Seg E (X, iv)) 


DEFINITIONS 

Ord(X) for E We X A Trans (X) (X is an ordinal class if and 

only if the £ -relation well-orders X and any member 
of X is a subset of X) 

On for {.r|Ord(v)} (The class of ordinal numbers) 


Thus, b (Vx) (x £ On <^=> Ord(x)). An ordinal class that is a set is called an 
ordinal number, and On is the class of all ordinal numbers. Notice that a wf 
x £ On is equivalent to a predicative wf — namely, the conjunction of the 
following wfs: 


(a) (Vn) (w £ x =>• u £ u) 

(b) (\/u)(u CxAu/0=^ (3n)(r € u A (Vvv)(w £ m Aw^ r 

(c) (Vw) (u £ x =£► u C x) 


i)£W>A 


(The conjunction of (a) and (b) is equivalent to E We x, and (c) is Trans(x).) 
In addition, any wf On £ F can be replaced by the wf (3 \y)(y £ FA 
(Vz) (z £ y z £ On)). Hence, any wf that is predicative except for the 
presence of ‘On’ is equivalent to a predicative wf and therefore can be used 
in connection with the class existence theorem. 
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4.31 Prove: (a) I- 0 G On. (b) K 1 g On, where 1 stands for {0}. 

We shall use lower-case Greek letters a, /?, y, <5, t, . . . as restricted variables 
for ordinal numbers. Thus, (Va)^(a) stands for (Vjc) (x e On ^ &{x)), and 
(3a) 0(a) stands for (3x)(jc g On A 0(x)). 


PROPOSITION 4.8 

(a) h Ord(X) =>(X<£X/\ (Vm)(«gI=^ u<£ u)) 

(b) h Ord(X) AfClA Trans(F) F G X 

(c) h Ord(X) A Ord(F) (F C X <=> Y G X) 

(d) h Ord(X) A Ord(F) [(X G F VX = F V F G X) A -i(X G F A Y e X) 

A n(XG F Vl = F)] 

(e) \-Ord(X) AY eX =>Y € On 

(f) h E We On 

( g ) I- Ord(Ow) 

(h) h-nM(0«) 

(i) h Ord(X) X = On VX G On 

(j) h y C On A Trans(j) ^ y E On 

(k) hx€OnA}'£0/i=>(xCjV}'Cjr) 

Proof 

(a) If Ord(X), then E is irreflexive on X; so, (Vu)(u e X ufiu); and, if 
X g X, X i X. Hence, X $ X. 

(b) Assume Ord(X) aFcIA Trans(F). It is easy to see that F is a proper 
E-section of X. Hence, by Exercise 4.30(b,c), Y eX. 

(c) Assume Ord(X) A Ord(F) . If F G X, then F C X, since X is transitive; 
but F ^ X by (a); so, F Cl. Conversely, if F c X, then, since F is 
transitive, we have F G X by (b). 

(d) Assume Ord(X) AOrd(F) AX ^ F. Now, XnY CX and XnY C Y. 
Since X and F are transitive, so is XnY. If XnY cl and TnF c F, 
then, by (b), XnY eX and XnY e F; hence, XnY eXnY, contra- 
dicting the irreflexivity of E on X. Hence, either XnY —X or 
XnY = F; that is, X C F or F C X. ButX ^ F. Hence, by (c), X g For 
Y eX. Also, if X g F and F G X, then, by (c), X c F and F c X, which 
is impossible. Clearly, XgFaX=Fis impossible, by (a). 

(e) Assume Ord(X) A F G X. We must show E We F and Trans(F). Since 
F G X and Trans (X), F C X. Hence, since E WeX, E We Y. Moreover, 
if u e Y and v G u, then, by Trans(X), v eX. Since E Con X and 
F G X A v G X, then veYVv — YVYev. If either v = Y or Y ev. 
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then, since E Tr X and u £ Y A v £ u, we would have u £ u, contra- 
dicting (a). Hence v £Y. So, if u £ Y, then u C Y, that is, Trans(F). 

(f) By (a), E Irr On. Now assume X C On /\X ^0. Let a £ X. If a is the 
least element of X, we are done. (By least element of X we mean an 
element v in X such that (Vw)(w £X Ai i^v=yv£ u ).) If not, then E 
We a and /na^0; let f3 be the least element ofXna. It is obvious, 
using (d), that (3 is the least element of X. 

(g) We must show E We On and Trans(On). The first part is (f). For the 
second, if u £ On and v £ u, then, by (e), v £ On. Hence, Trans(On). 

(h) If M (On), then, by (g). On £ On, contradicting (a). 

(i) Assume Ord(Xf). Then X C On. If X ^ On, then, by (c), X £ On. 

(j) Substitute On for X and y for F in (b). By (h), y c On. 

(k) Use parts (d) and (c). 

We see from Proposition 4.8(i) that the only ordinal class that is not an 

ordinal number is the class On itself. 


DEFINITIONS 

x < 0 y for x £ On Ay £ On Ax £ y 
x<o y for y £ On A (x = y V x < 0 y) 

Thus, for ordinals, < 0 is the same as £\ so, < 0 well-orders On. In particular, 
from Proposition 4.8(e) we see that any ordinal x is equal to the set of 
smaller ordinals. 


PROPOSITION 4.9 (TRANSFINITE INDUCTION) 

I- (Vj8)[(Va)(a G jB => a eX) => (5 eX] On CX 

(If, for every fi, whenever all ordinals less than (i are in X , must also be in 
X, then all ordinals are in X.) 


Proof 

Assume (V/?)[(Va)(a £j5^a£l) =$■ (3 £X]. Assume there is an ordinal in 
On — X. Then, since On is well-ordered by E, there is a least ordinal (3 in 
On — X. Hence, all ordinals less than (3 are inX. So, by hypothesis, [3 is in X, 
which is a contradiction. 

Proposition 4.9 is used to prove that all ordinals have a given property 
t%(a). We let X = (x|^(x) Ax £ On} and show that (V/?)[(Va)(a £ (3 =4- 

#(«)) =* &(p)]. 



DEFINITION 
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x' for xu{x} 

PROPOSITION 4.10 

(a) h (Vx)(x £ On x' € On) 

(b) h (Va)--(3/?)(a < 0 P <o «') 

(c) h (Va)(V/?)(a' = ft ^ a — ft) 


Proof 

(a) x £ x'. Hence, if x! E On, thenx £ On by Proposition 4.8(e). Conversely 
assume x E On. We must prove E We (xw{x}) and Trans(xw{x}). Since 
E We x and x ^ x, E Irr (xw{x|). Also, if y ^ 0 A y C xu{x}, then either 
y = |x}, in which case the least element of y is x, or^nx/0 and the 
least element of yr\x is then the least element of y. Hence, E We 
(xu{x}). In addition, if >>£xu{x} and uEy, then uEx. Thus, 
Trans(xu{x}). 

(b) Assume a < G P < 0 a'. Then, a E ft A ft E o'. Since a E ft, P£ a and ft ^ a 
by Proposition 4.8(d), contradicting ft E a'. 

(c) Assume a' = 0 . Then ft < G af and, by part (b), /?< G a. 

Similarly, cc^ 0 p. Hence, a — p. 

Exercise 

4.32 Prove: h (Va)(a C a') 

DEFINITIONS 

Suc(Y) for X G On A (3a) (X — a') (X is a successor ordinal ) 

K| for {x|x = 0 V Suc(x)} (the class of ordinals of the first kind) 

co for {x|x G Ki A Vm)(h Gx=>wGKi)} (co is the class of all ordinals a of the first 

kind such that all ordinals smaller 
than a are also of the first kind) 

Example 

P 0 E co A 1 £ co. (Recall that 1 = {0}.) 

PROPOSITION 4.11 

(a) P(Va)(a £ co a' E co) 

(b) PM(a>) 





/ c ) b 0 £ X A (Vk)(m Gl=^i/€l)^C!)CI 

(d) I" (Va)(a £coA/?< 0 a=^/?£co) 


Proof 

| a ) Assume a £ co. Since Suc(a'), d £ K| . Also, if (3 £ d, then (3 £ a or 
ft = a. Hence, (3 £ Ki . Thus, a' £ co. Conversely, if a' € co, then, since 
a £ ci' and ( y(3)(f3 £ a => [3 £ d), it follows that a £ co. 

Tb) By the axiom of infinity (I), there is a set x such that 0 £ x and 
(\/u)(u E x=$- d Ex). We shall prove co C x. Assume not. Let a be the 
least ordinal in co — x. Clearly, a fi 0, since 0 £ x. Hence, Suc(a). So, 

(3 0){a = 0). Let S be an ordinal such that a = S' . Then S < 0 a and, by 
part (a), S £ co. Therefore, 6 £ x. Hence, S' £ x. But a = S' . Therefore, 
a £ x, which yields a contradiction. Thus, co C x. So, M(co) by Corol- 
lary 4.6(b). 

(c) This is proved by a procedure similar to that used for part (b). 

(d) This is left as an exercise. 

The elements of co are called finite ordinals. We shall use the standard 
notation: 1 for 0', 2 for 1', 3 for 2', and so on. Thus, 

0 G co, 1 G co, 2 G co, 3 £ co, .... 

The non-zero ordinals that are not successor ordinals are called 
limit ordinals. 

DEFINITION 

Lim(x) for x £ On A x £ Kj 

Exercise 

4.33 Prove: 

(a) b Lim (co) 

(b) b (Va)(V/?)(Lim(a) A (3 < G a =b 0 < 0 a). 

PROPOSITION 4.12 

(a) b (Vx)(x C On =^[(Jx £ On A (Va)(a G x => a< 0 (J x ) A (V/?)((Va) 

(a £ x ^ ot^ 0 (3) => (Jx^ 0 «])• (if* is a set of ordinals, then (Jx is an 
ordinal that is the least upper bound of x.) 

(b) b (Vx)(x C On A x 0 A (Va)(a £ x => (3 0)(f3 £ x A a < 0 (3))\ 

=b Lim((Jx)). (If x is a non-empty set of ordinals without a maximum, , 
then (J x is a limit ordinal.) 




Proof 
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(a) Assume x C On. (Jx, as a set of ordinals, is well-ordered by E. Also jr 

a e\Jx A (3 E a, then there is some y with y E x and a ey. Tlfen 
(3 E a A a € y; since every ordinal is transitive, (3 e y. So, ft £ 
Hence, (Jjc is transitive and, therefore, (Jx E On. In addition, if oc e V 
then a C (Jjc; so, a^ 0 (Jx, by Proposition 4.8(c). Assume now that 
(Va)(a a< 0 /?)- Clearly, if 3 E (Jjc, then there is some y such that 

3 e y A y E x. Hence, y< 0 /5 and so, 5 < 0 ft. Therefore, (Jjc C (3 and, by 
Proposition 4.8(c), (Jx< 0 /?. 

(b) Assume jc C 0/?Ax^0A(Va)(a Gx=> (3 (3){(3 E x A a < 0 (3)). If (J x _ ^ 
then a Ex implies a = 0 . So, x = 0 or x = 1 , which contradicts our? 
assumption. Hence, (Jx ^ 0 . Assume Suc((Jx). Then (Jx = y' for sonir 
y. By part (a), (Jx is a least upper bound of x. Therefore, y is not aiT 
upper bound of x; there is some 3 in x with y < 0 S. But then 3 = (u 
since (Jx is an upper bound of x. Thus, (Jx is a maximum element of* * 
contradicting our hypothesis. Hence, -iSuc((Jx), and Lim((Jx) is the 
only possibility left. 


Exercise 
4.34 Prove: 

(a) I- (Va)([Suc(a) =4 ((J a)' — a] A [Lim(a) =4 (J a — a]). 

(b) If 0 7^ x C On, then (J x is the least ordinal in x. 

We can now state and prove another form of transfinite induction. 


PROPOSITION 4.13 (TRANSFINITE INDUCTION: SECOND FORM) 

(a) h[0eJA (Va)(a EX) a (Va)(Lim(a) A (V(3)((3 < Q a=^/? eX) 

=^ael)]^ On C X. 

(b) ( Induction up to (5.) h [0 E X A (Va)(a < D 3 A a EX 
=> a 1 E X) A (Va)(a < c 3 A Lim(a) A (V/?)(/? < c a 
=4- P E X) =4 a E X)\ =4 3 C X. 

(c) ( Induction up toco.) I-0E X A (Va) (a < 0 to A a E X =4- a' E X) =4- co C X. 

Proof 

(a) Assume the antecedent. Let Y ~ {x|x E On A (Va)(a^ 0 x =4- a EX)}. It 
is easy to prove that (Va)(a < 0 y =4 a E Y) =4 y E Y. Hence, by Pro- 
position 4.9, On C Y. But F C X. Hence, On C X. 

(b) The proof is left as an exercise. 

(c) This is a special case of part (b), noting that b (V«)(<x <o CO => 
-iLim(a)). 
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PROPOSITION 4.14 

( a ) h (yx) (3 ! F) (Fnc ( F) A 9{ Y) = On A (Vcc) ( Y l a = X' (a f F )) ) . (Given X, 
there is a unique function F defined on all ordinals such that the value 
of Y at a is the value of X applied to the restriction of Y to the set of 
ordinals less than a.) 

(b) F (Vx)(VXi)(VX 2 )(3 1 r)(Fnc(r) A 2{Y) = On A F‘0 = x A (Va)(F‘(a') = 
X x l {Y i a)) A (Va)(Lim(a) =4- F‘a = ^‘(afF))). 

(c) {Induction up to 8.) b (Vx)(VZ,)(VA 2 )(3 1 F)(Fnc(F) A 9{Y) =5 A F‘0 = 
x A (Va)(a' < G 8 => F*(cd) = Aj‘(F‘a)) A (Va)(Lim(a) A a <o <5 F‘a = 

X 2 ‘(af r))). 


Proof 

(a) Let Fi = (i/|Fnc(«) A 3>{u) £ On A (Va)(a £ @>(u) =>• u l a = X'{a f «))}. 
Now, if u\ £ Y\ and u 2 £ Fj, then u\ C u 2 or u 2 C iq. In fact, let 
y x — 2){u\) and y 2 = Ss{u 2 ). Either y x < c y 2 or y 2 say, y x ^ Q y 2 . Let 
w be the set of ordinals a < 0 y x such that u\a ^ u 2 a\ assume w / 0 and 
let t] be the least ordinal in w. Then for all fi < c tj, u\ ‘/? = u 2 (i. Hence, 
u\‘a = r\[u 2 . But u^r} = X l {t]l iq) and u 2 t r} — X l {t] ^ u 2 ); and so, 
u\ i t] = u 2 t}, contradicting our assumption. Therefore, w = 0; that is, for 
all a< G }q, u\o, — w 2 ‘a. Hence, u x = y x [u x = y x iu 2 c u 2 . Thus, any two 
functions in Y\ agree in their co mm on domain. Let Y — (J Fi . We leave 
it as an exercise to prove that F is a function, the domain of which is 
either an ordinal or the class On, and (Va)(a £ 3>(Y)=> Y‘a =X‘(af F)). 
That S)(Y) = On follows easily from the observation that, if 3>(Y) = 8 
and if we let W = Fw{(<5,X‘F)}, then W £ Y x ; so, W CF and 
8 £ @(Y) = 8, which contradicts the fact that 8 £ 8. The uniqueness of 
F follows by a simple transfinite induction (Proposition 4.9). 

The proof of part (b) is similar to that of (a), and part (c) follows from (b). 

Using Proposition 4.14, one can introduce new function letters by 
transfinite induction. 


Examples 

1. Ordinal addition. In Proposition 4.14(b), take 

x—fi X x = {{m, u)|y = i/} X 2 = {{u,v)\v = \^8%{u)} 
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Hence, for each ordinal /?, there is a unique function Yp such that 
Y p % = fi A (Vo:)(F/?‘(cO = (Yp l a)' A [Lim(a) =► Yp l a = (J(E /J tt a)]) 

Hence there is a unique binary function + G with domain (On) 2 such that, for 
any ordinals p and y, + c (/?, y) = Yp l y. As usual, we write p + c y instead of 
+ 0 (/?,y). Notice that: 


P +o 0 = P 
P +o (/) = (P +o y)' 
Lim(a) =>• P + 0 a = (JW+»r) 

T< Q a 


In particular, 

P+ol = P+o (0') = (P +o 0)' = P' 

2. Ordinal multiplication. In Proposition 4.14(b), take 

x = 0 Xi={{u,v)\v~u+ o p} Xi — {(w, = U&(u)} 

Then, as in Example 1, one obtains a function p x 0 y with the properties 

P x o 0 = 0 

P x c (/) = (P x 0 y) +o P 

Lim(o:) => ft x c a = [J (p x c t) 

T<o« 


Exercises 

4.35 Prove: h P x 0 1 = P A p x c 2 = p + G p. 

4.36 Justify the following definition of ordinal exponentiation.^ 

e xp (p, 0) = 1 

exp(j8, /) = exp (P, y) x 0 p 
Lim(c/.) =^exp(/J, a) = U exp (/?, t) 

0<o T <oa 

For any class X, let Ex be the membership relation restricted to X; that is, 
Ex — { (n, v)\u G v f\u G X A t) £ A}. 


l\Ve use the notation cxp(/j, a) instead of ff in order to avoid confusion with the 
notation X Y to be introduced later. 
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PROPOSITION 4.15+ 

Let R be a well-ordering relation on a class F; that is, R We F. Let F be a 
function from Y into F such that, for any u and v in F, if (u, v } £ R, then 
(F‘w,F‘i;) £ R. Then, for all u in Y, u = F l u or (u,F l u) £ R. 


proof 

Let A — {u\{F l u, u) £ /?}. We wish to show that A = 0. Assume X / 0. Since 
JC F and R well-orders F, there is an /('-least element uq of X. Hence, 
(F‘uo,uo) £ R- Therefore (F‘(F‘«o),F‘wo) £ R. Thus, F‘wo £ A, but F £ i / 0 is 
/^-smaller than contradicting the definition of uq. 


COROLLARY 4.16 

If F is a class of ordinals, F: Y — > Y, and F is increasing on F (that is, 
a£FA/?£FAa< 0 /?=^ F £ a < 0 F l (3), then a < 0 F £ a for all a in F. 


Proof 

In Proposition 4.15, let R be Ey. Note that Er well-orders F, by Proposition 
4.8(1) and Exercise 4.25. 


COROLLARY 4.17 

Let a < 0 P andy C a; that is, let y be a subset of a segment of /?. Then (E^, /?) 
is not similar to {Ey,y). 


Proof 

Assume (E p, (3) is similar to (Ej,,y). Then there is a function / from (3 onto y 
such that, for any u and v in (3, u < c v o /‘w < 0 f l v. Since the range of / isy, 
/‘a Ey. Buty C a. Hence /‘a < 0 a. But, by Corollary 4.16, a^ 0 /‘a, which 
yields a contradiction. 


iFrom this point on, we shall express many theorems of NBG in English by 
using the corresponding informal English translations. This is done to avoid writing 
lengthy wfs that are difficult to decipher and only in cases where the reader should be 
able to produce from the English version the precise wf of NBG. 
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(a) For (E a , a) and (Ep,P) are not similar. 

(b) For any a, if / is a similarity mapping of (E a , a) with {E a , a), then / i s 
the identity mapping, that is, f ( p — p for all p < G a. 


Proof 

(a) Since a ^ /?, it follows by Proposition 4.8(d,c) that one of a and R is a 
segment of the other; say, a is a segment of /?. Then Corollary 4. 1 7 tells 
us that (E p,p) is not similar to (E a ,a). 

(b) By Corollary 4.16, /‘/?^ 0 /? for all p < c a. But, noting by Exercise 
4.26(b) that f is a similarity mapping of (E a , a) with (E a , a), we again 
use Corollary 4.16 to conclude that {JYP'^qP for all ft < 0 a. Hence 
P = 0'YifP)>of i P^oP and, therefore, /‘/? = p. 


PROPOSITION 4.19 

Assume that a non-empty set u is the field of a well-ordering r. Then there is 
a unique ordinal y and a unique similarity mapping of (E ? , y) with (/-, u ) . 


Proof 

Let F ~ {(r, w) |vv E u — v A (Vz)(z E u — v =>■ (z, w) ^ r)}. F is a function 
such that, if v is a subset of u and u — v ^ 0, then F‘r is the / -least element of 
u — v. Let X = {{r, w) w) E F }. Now we use a definition by transfinite 
induction (Proposition 4.14) to obtain a function Y with On as its domain 
such that (Va)(T‘oc =X‘(a ( T)). Let W = {a|F“a C u A u — Y ll a ^ 0}. 
Clearly, if a E W and P E a, then p E W. Hence, either W = On or W is some 
ordinal y. (If W ^ On, let y be the least ordinal in On — W.) If a E W, then 
Y‘a = X‘(a l Y) is the r-least element of u~Y u a; so, T‘a E u and, if PEa, 
Y‘a Y 1 Y‘p. Thus, Y is a one-one function on W and the range of Y restricted 
to IF is a subset of u. Now, let h = (FFfT) and f = h; that is, let / be the 
inverse of Y restricted to W. So, by the replacement axiom (R), IF is a set. 
Hence, IF is some ordinal y. Let g = yf Y. Then g is a one-one function with 
domain y and range a subset u\ of u. We must show that uy — u and that, if a 
and P are in y and P < G a, then ( g‘P , g‘oc) E r. Assume a and /? are in y and 
P < 0 a. Then g u p C g u a and, since g l a eu — g“a,g‘a E u — g u p. But g‘P is 
the /'-least element of u — g u p. Hence, ( g i p , g l a) E r. It remains to prove that 
u\ — u. Now, uy = T“y. Assume u — uy ^ 0- Then y E W. But IF = y, which 
yields a contradiction. Hence, u = uy. That y is unique follows from Cor- 
ollary 4. 1 8(a). 




EQUINUMEROSITY. FINITE AND DENUMERABLE SETS 
jExercise 

4.37 Show that the conclusion of Proposition 4.19 also holds when u — 0 
a nd that the unique ordinal y is, in that case, 0. 


PROPOSITION 4.20 

Let R be a well-ordering of a proper class X such that, for each y £X, the 
class of all ^-predecessors of y in X (i.e., the /^-segment inX determined by 
y) is a set. Then R is 'similar’ to Eo„l that is, there is a (unique) one-one 
mapping H of On onto X such that (H l a 1 H L (i) £ R. 

Proof 

Proceed as in the proof of Proposition 4.19. Here, however, W — On; also, 
one proves that M(Y) = X by using the hypothesis that every /?- segment of 
X is a set. (If X — 01{Y) ^ 0, then, if w is the ^-least element of X — &(Y), 
the proper class On is the range of F, while the domain of F is the /^-segment 
of X determined by w, contradicting the replacement axiom.) 

Exercise 

4.38 Show that, if X is a proper class of ordinal numbers, then there is a 
unique one-one mapping H of On onto X such that a £ //‘cc £ 


4.3 EQUINUMEROSITY. FINITE AND DENUMERABLE SETS 

We say that two classes X and Y are equinumerous if and only if there is a 
one-one function F with domain X and range F. We shall denote this by 
X = F. 


DEFINITIONS 

X“T for Fnc, (F) A 3>{F) = X A 8t(F) = Y 

F 

X = F for (3E)(X=F) 

F 

Notice that h (Vx)(Vy)(x = y (3z)(x—>’)). Hence, a wf x =y is predicative 

Z 

(that is, is equivalent to a wf using only set quantifiers). 
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Clearly, if X = Y, then Y = X, where G — F. Also, if X = Y and Y 

F G F\ 

then X =Z, where H is the composition F 2 o F\ . Hence, we have the 

H 

lowing result. 



PROPOSITION 4.21 

(a) YX^X 

(b) \-X=Y^Y=X 

(c) YX=YAY = Z=^X = Z 


PROPOSITION 4.22 

(a) h(J^7AZ^ W AXnZ=®AYnW = Q)) ^ XuZ = YnW 

(b) \-(X9*Y AZS*W)=>XxZ**YxW 

(c) h Xx{y}^X 

(d) h XxY^YxX 

(e) h (X x T) x Z 2 * X x (Y x Z) 

Proof 

(a) Let X = Y and Z = W. Then XkjZ = YuPV, where H — FkjG. 

f g H 

(b) Let X~Y and W. Let 

^ ^ ^ 

H = {(«, £>)|(3*)(3j>)(x EX Ay £ Z Au = (x,y) Av = {F% G‘y))} 

Then X x Z = Y x W. 

H 

(c) Let F — {(w, r}|« £ X A v = ( u,y )}. Then X= X x {y}- 

(d) Let F = {(w, r)|(3*)(3y)(* € X Ay £ Y A u = \x,y) A v = (y,x))}. 

Then X x Y= Y x X. 

F 

(e) Let F = {(m, r)|(3x)(3y)(3z)(.x £ X Ay £ Y A z £ Z A u = {(x,y),z) A v 
= {x, (y,z)))}. Then (X x Y) x Z = X x (Y x Z). 

F 


DEFINITION 

X Y for {u\u: Y -> X} 

X Y is the class of all sets that are functions from Y into X. 








5Exercises 

prove the following. 

4.39 h (VZ)(V7)(3X ] )(3F 1 )(X^X 1 A 7 = F[ AX,n7, = 0) 

4.40 I- 3P{y) = 2 y (Recall that 2 — (0, 1} and 1 = {0}.) 

4*41 (a) h -.M (7) =* X Y = 0 

(b) h (Vx)(Vp)M(x y ) 

,4.42 (a) h X 0 = 1 

(b) hl^l 

(c) (- 7 ^ 0 => 0 y = 0 

4.43 \-X**XM 

4.44 \-X^YAZ^W^X z ^Y w 

4.45 h XnY = 0 => Z* uy ^ Z x x Z y 

4.46 h (Vx)(Vp)(Vz)[(* y ) z ^x yXZ ] 

4.47 h {X x 7; ^J z xf z 

4.48 7 (\/x)(\/R)(R We x => (3a)(jc = a)) 
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We can define a partial order ^ on classes such that, intuitively, X^ 7 if 
and only if 7 has at least as many elements as X. 


definitions 

X<7 for (3Z)(Z CYAX^Z) 

(X is equinumerous with a subclass of 7) 
X -< 7 for J<f A --(X s 7) 

(7 is strictly greater in size than X) 


Exercises 


Prove the following. 

4.49 h X^7 & (X X 7 VX £* 7) 

4.50 hl^7 A -iM(X) => -iM(7) 

4.51 h X< 7 A (3Z)(Z We 7) => (3Z)(Z We X) 

4.52 h (Va)(V/?)(a^/? V /?=^a) [///nt: Proposition 4.8(k).] 


PROPOSITION 4.23 

(a) l-X^XA-.(X -<X) 

(b) l-XC 7=>X^7 

(c) l-X^7A7<Z=^X^Z 

(d) I- X=^7 A 7=^X => X = Y (Bernstein’s theorem) 
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Proof 

(a), (b) These proofs are obvious. 

(c) Assume X==7i A F| C I A 7==Zi A Z\ C Z. Let H be the composition of 
F and G. Then ®{H) C Z AX^®{H). So, X^Z. 

(d) There are many proofs of this nontrivial theorem. The following one 
was devised by Heilman (1961). First we derive a lemma. 

Lemma. Assume Xn7 = 0, XnZ - 0 and 7nZ = 0, and let X~X u Y uZ 
Then there is a G such that XsXu7. F 

G 

Proof. Define a function H on a subclass of X x co as follows; 
((u, k), v) E H if and only if u EX and k € co and there is a function / with 
domain k! such that/‘0 — F‘u and, if j Ek, then f l j € X and /‘(/) — F l {f l j) 
and f l k = v. Thus, #*(<«,(&)) = #'(<«, 1)) = F‘(F‘«) if e X , and 

2)) = F‘(F‘(.F‘tt)) if F‘m and F l (F l u) are inX, and so on. LetX* be the 
class of all u in X such that (3y) (y €co A ( u,y ) 6 A H l ((u,y)) € Z). Let 
7* be the class of all u in X such that (Vy)(y E co A (u,y) E 9A{E) 
=>• H l ((u,y))fZ). Then X = X*u7*. Now define G as follows: @}(G) =X 
and, if u E X *, then G‘u = u, whereas, if u E Y*, then G l u - F‘u. Then 
X = Xu 7. (This is left as an exercise.) 

G 

Now, to prove Bernstein’s theorem, assume X 7j A 7i C 7 A Y~X \A 

Xj C X. Let A = G“7i CI,CX. But An{X t - A) = 0,An(X - X^ = 0 and 
(X — Xi)n(Xi - A) = 0. Also, X = (X-Xi)u(Xi - A)uA , and the com- 
positon H of F and G is a one-one function with domain X and range A. 
Hence, A=X. So, by the lemma, there is a one-one function D such that 

H 

A ^ X l (since (Xi — A) u A = Xi). Let T be the composition of the functions 

H,D and G; that is, T‘u — (Gy(D‘(H l u)). Then X = Y, since X = A and 
A “ X x and X\^Y. T H 

D G 

Exercises 

4.53 Carry out the details of the following proof (due to J. Whitaker) of 
Bernstein’s theorem in the case where X and Y are sets. Let 
X = Y\ A Y\ C Y A Y — X\ A X\ C X. We wish to find a set Z C X such that 

G, restricted to Y — F“Z , is a one-one function of Y — F“Z onto X - Z. [Tf 
we have such a set Z, let H = (Z|F)u((X — Z) £ (7) ; that is, /Tx = F l x for 
x E Z, and = G l x for jc E X — Z. Then X=7.] Let Z = (jc j (3w)(u 

C X Ax E u A G U (Y — F tl u) C X — «)}. Notice that this proof does not 
presuppose the definition of c o nor any other part of the theory of ordinals. 

4.54 Prove: (a) HX ^ Xu 7 (b) b X 7 => -i(7 ^ X) (c) h X 7 A 7<Z 
=^X-^Z 
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PROPOSITION 4.24 Assume and Then: 

(a) YnB = 0 ^XkjA^YvB 

(b ) XxA^YxB 

(c) X A ^Y B if B is a set and -»(X = A = F = 0 A 5 / 0) 


Proof 

(a) Assume X=.Yi C Y and A=B\ C B. Let H be a function with domain 

F G 

X uA such that H‘x — F‘x for x EX, and H‘x = G‘x for x E A —X. Then 
XvA£*H“(XvA) C YvB. 

H 

(b) and (c) are left as exercises. 

PROPOSITION 4.25 

(a) I — i(3/)(Fnc(/) A S){f) — x A M{f) = £P(x)). (There is no function 
from x onto £A(x).) 

(b) hr -< &(x) (Cantor’s theorem) 

Proof 

(a) Assume Fnc(/) A &>{f) = x A &(f) = 8P{x). Let y — ( u\u Ex A 
n^/‘n}. Then y E &{x). Hence, there is some z in x such that /‘z = y. 
But, (Vm)(m E y 44 u E x A f l u). Hence, (Vm)(w € f l z 44 u E x A u 
£f l u). By rule A4, z E f l z 44 z € x A z^/‘z. Since zEx, we obtain 
z€/‘z 44 z/ f% which yields a contradicition. 

(b) Let / be the function with domain x such that f l u = («} for each u in x. 
Then /“ x C E? (x) and / is one-one. Hence, x=v^(x). By part (a), 
x = £A(x) is impossible. Hence, x -< ^(x). 

In naive set theory, Proposition 4.25(b) gives rise to Cantor’s paradox. If 
we let x = V, then V £A(V). But £A(V) C V and, therefore, £A(V) ^ V. 
From V -4 £A(V), we have By Bernstein’s theorem, V 

contradicting V -< PA(V). In NBG, this argument is just another proof that 
V is not a set. 

Notice that we have not proved I- (' Vx) (Vy) (x y V y x) . This in- 
tuitively plausible statement is, in fact, not provable, since it turns out to be 
equivalent to the axiom of choice (which will be discussed in Section 4.5). 

The equinumerosity relation = has all the properties of an equivalence 
relation. We are inclined, therefore, to partition the class of all sets into 
equivalence classes under this relation. The equivalence class of a set x 
would be the class of all sets equinumerous with x. The equivalence classes 
are called Frege— Russell cardinal numbers. For example, if n is a set and 
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x = {«}, then the equivalence class ofx is the class of all singletons {/;} and is 
referred to as the cardinal number l c . Likewise, if and y = {«, v}, then 
the equivalence class of y is the class of all sets that contain exactly two 
elements and would be the cardinal number 2 C ; that is 2 C i s 
{x|(3w)(3z)(w ^ z Ax = {w,z})}. All the Frege-Russell cardinal numbers 
except the cardinal number O c of 0 (which is {0}), turn out to be proper 
classes. For example, V = l c . (Let F l x = {x} for all x. Then V~ l c .) But 
-iM(F). Hence, by the replacement axiom, -iM(l c ). F 


Exercise 


4.55 Prove I — iM(2 c ). 

Because all the Frege-Russell cardinal numbers (except O c ) are proper 
classes, we cannot talk about classes of such cardinal numbers, and it is 
difficult or impossible to say and prove many interesting things about them. 
Most assertions one would like to make about cardinal numbers can be 
paraphrased by the suitable use of =, and -<. However, we shall see later 
that, given certain additional plausible axioms, there are other ways of de- 
fining a notion that does essentially the same job as the Frege-Russell 
cardinal numbers. 

To see how everything we want to say about cardinal numbers can be 
said without explicit mention of cardinal numbers, consider the following 
treatment of the ‘sum’ of cardinal numbers. 


DEFINITION 

X+ C Y for (X x {0})u(F x {1}) 

Note that F 0 ^ 1 (since 1 is {0}). Hence, X x {0} and Y x {1} are disjoint 
and, therefore, their union is a class whose ‘size’ is the sum of the ‘sizes’ of X 
and Y. 

Exercise 
4.56 Prove: 

(a) Ya Y^XF c Y 

(b) \-X**A/\Y**B=>X+ c Y2*A+ c B 

(c) h X+ C Y^Y+ C X 

(d) h M(Y + c Y) M(Y) A M(T) 

(e) YX+ C (Y+ C Z)^{X+ C Y)+ C Z 

(f) h X^Y ^X+ C Z^Y+ C Z 

(g) h X + C X = X x 2 (Recall that 2 is {0, 1 }.) 

(h) hX r+cZ ^X r xX z 

(i) h x = x + c 1 => 2 X +c x = 2 X 
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Remember that oo is the set of all ordinals a such that a and all smaller 
ordinals are successor ordinals or 0. The elements of oo are called finite 
ordinals , and the elements of On — co are called infinite ordinals. From an 
intuitive standpoint, oo consists of 0,1,2, 3,..., where each term in this 
sequence after 0 is the successor of the preceding term. Note that 0 contains 
no members, 1 = {0} and contains one member, 2 — {0, 1} and contains 
two members, 3 = {0, 1 , 2} and contains three members, etc. Thus, it is 
reasonable to think that, for each intuitive finite number n, there is exactly 
one finite ordinal that contains exactly n members. So, if a class has n 
members, it should be equinumerous with a finite ordinal. Therefore, a class 
will be called finite if and only if it is equinumerous with a finite ordinal. 


definition 

Fin(X) for (3a)(a € co A X = a) {X is finite) 


Exercise 
4.57 Prove: 

(a) h Fin(X) M(X) (Every finite class is a set) 

(b) h (Va)(a E oo Fin(a)) (Every finite ordinal is finite.) 

(c) h Fin(X) AI=F=^ Fin(F) 


PROPOSITION 4.26 

(a) h (Va)(a ^ co a = a'). 

(b) b (Va)(V/?)(a € co A a f p => ->(a = /?)). (No finite ordinal is equinu- 
merous with any other ordinal.) 

(c) h (Va)(Vx)(a € co Ax C a -i(a = x)). (No finite ordinal is equinu- 
merous with a proper subset of itself.) 


Proof 

(a) Assume a ^ oo. Define a function / with domain a' as follows: /‘<5 = S' 
if <5 E oo; /‘<5 = <5if<5€a'A<5£; cou{a}; and /‘a = 0. Then a ' <= a. 

(b) Assume this is false, and let a be the least ordinal such that a £ co and 
there is /? f a such that a — f3. Hence, a <o fi. (Otherwise, fi would be a 
smaller ordinal than a and ft would also be in oo, and fl would be 
equinumerous with another ordinal, namely, a.) Let a ^ /?. If a = 0, 
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then / = 0 and (3 — 0, contradicting a (3. So, a 7C 0. Since 
a e ( o ) a = y for some <5 € m. We may assume that (3 = y' for some y. (if 
P G to, then (3 7 C 0; and if p ^ co, then, by part (a), /? ^ (!' and we can 
take /f instead of /?.) Thus, <5' = a = y'. Also, <5 7^ y, since a 7^ 

Case 1. f‘8 = y. Then <5 = y, where £/ = <5f/. 

2. /‘<5 7^ y. Then there is some /i G <5 such that /‘ju = y. Let 
h = (W)-{(fry}}) (J {(vJ‘fyY, that is, let /z‘t=/‘t if x£{5,n}, and 
Hfx = f‘8. Then 5 = y. 

h 

In both cases, b is a finite ordinal smaller than a that is equinumerous 
with a different ordinal y, contradicting the minimality of ot. 


(c) Assume p E (o Ax C P A P = x holds for some (3 , and let a be the least 
such (3. Clearly, a 7^ 0; hence, a = y' for some y. But, as in the proof of 
part (b), one can then show that y is also equinumerous with a proper 
subset of itself, contradicting the minimality of a. 


Exercises 

4.58 Prove: I- (Va)(Fin(a) co). 

4.59 Prove that the axiom of infinity (I) is equivalent to the following 
sentence. 


(*) (3x)((3«)(u G x) A (Vy)(y G x =>- (3z)(z G x Ay c 2 ))) 


PROPOSITION 4.27 

(a) I- Fin(Y) A 7 CJ=j- Fin(T) 

(b) h Fin(Y) => Fin(Yu{y}) 

(c) h Fin ( A) A Fin(T) =► Fin (Yu T) 


Proof 

(a) Assume Fin(Y) AY Cl. Then Y = a, where a € c 0 . Let g =Ylf and 
W = g“Y C a. IF is a set of ordinals, and so, is a well-ordering of IF. 
By Proposition 4.19, (£V, IF) is similar to {Ep,p) for some ordinal [3. 
Hence, W = (3. In addition, (3 ^ 0 <x. (If a < D (3, then the similarity of 
( Ep,(3 ) to {EwiW) contradicts Corollary 4.17.) Since a. E co,p E co. 
From Y = W AW = (3, it follows that Fin(T). 

(b) If y € Y, then Yu{y} = Y and the result is trivial. So, assume y£X. 

From Fin(Y) it follows that there is a finite ordinal a and a function / 
such that a=Y. Let g = /u{(a,y)}. Then a ' = Ju{y}. Hence, 
Fin(AuM). f ' 
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(c) Let Z = {u\u € w A (Vx) (Vy) (V/) (x = a A Fin(y) => Fin(x uy))}. We 

must show that Z = co. Clearly, 0 G Z, for if x = 0, then x — 0 and 
jcuj^ = y Assume that a € Z. Let x = a' and Fin(y). Let w be such that 

f l w = a and let xi = x — {>v}. Then xi = a. Since a € Z, Fin(xi uy). 
But x u y = (xi uj)u{uj. Hence, by part (b). Fin (A uj'). Thus, a ' € Z. 
Hence, by Proposition 4. 1 1 (c), Z = oj. 


definitions 

DedFin(A) for M(A) A (VF)(F cl => ~>(X ^ 7)) 

(X is Dedekind- finite, that is, X is a set that is not cquinumerous 

with any proper subset of itself) 

DedInf(A) for M(A) A ->DedFin(A) 

(X is Dedekind-infinite , that is, X is a set that is equinumerous 

with a proper subset of itself) 


COROLLARY 4.28 

(Vx)(Fin(x) =>■ DcdFin(x)) (Every finite set is Dedekind-finite)^ 

Proof 

This follows easily from Proposition 4.26(c) and the definition of ‘finite’. 


DEFINITIONS 

Inf(A) for ->Fin(A) 

Den (A) for X^m 

Count (A") for Fin (A) V Den (A) 


(A is infinite ) 

(A is denumerable ) 
(A is countable) 


Exercise 
4.60 Prove: 

(a) h Inf (A) A X s* Y Inf( Y) 

(b) h Den (A) A A ^ Y Dcn(F) 

(c) h Den(A) M(A) 

(d) h Count(A) A A = Y Count(F) 

(e) h Count (A) ^ M(A) 


Hhe converse is not provable without additional assumptions, such as the ax- 
iom of choice. 
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PROPOSITION 4.29 

(a) h Inf (X) A X C Y => Inf ( Y) 

(b) I- Inf(X) [nf(Yu{^» 

(c) F DedInf(Af) => Inf(Af) 

(d) F lnf(co) 


Proof 

(a) This follows from Proposition 4.27(a). 

(b) F Inf(Af) => Inf(ATj{v}) by part (a), and F Inf(Yu{y|) =>- lnf(X) by 
Proposition 4.27(b) 

(c) Use Corollary 4.28. 

(d) F co £ co. If Fin(co), then co = a for some a in co, contradicting Propo- 
sition 4.26(b). 


PROPOSITION 4.30 

F (Vr)(Vz)(Den(r) Az C v => Count(z)). (Every subset of a denumerable set 
is countable.) 

Proof 

It suffices to prove that z C co =$■ Fin(z) V Den(z). Assume z C co A -<Fin(z). 
Since -iFin(z), for any a in z, there is some (3 in z with a < 0 (3. (Otherwise, 
z C o' and, since Fin(a'), Fin(z).), Let AT be a function such that, for any a in 
co, X‘a is the least ordinal p in z with a < c p. Then, by Proposition 4.14(c) 
(with S — co), there is a function Y with domain co such that Y% is the least 
ordinal in z and, for any y in co, Y i (/) is the least ordinal p in z with 
P >o Y V* Clearly, Y is one-one, <2>(Y) = co, and T“co C z. To show that 
Den(z), it suffices to show that F“co = z. Assume z — F“co ^ 0. Let S be the 
least ordinal in z - F“co, and let t be the least ordinal in F“co with t > c 5. 
Then % — Y‘a for some cr in co. Since S < 0 t, o =£ 0. So, o — \Y for some fi in 
co. Then t = F‘<r is the least ordinal in z that is greater than F‘ju- hut 
5 > 0 Y l j.i, since t is the least ordinal in F“co that is greater than <5. Hence, 
t^ 0 <5, which contradicts S < Q t. 


Exercises 

4.61 Prove: F Count(Af) A Y C X Count (T). 

4.62 Prove: 

(a) F Fin(AQ => Fin(^(AT)) 

(b) h Fin(Af) A (Vy)(y e X => Fin(y)) =*• Fin(U*) 
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(c) Fin(F) => Fin(X) 

(d) F Fin(^(X)) =» Fin(X) 

(e) h Fin(U^) => Fin(X) A (Vy) ( v € X => Fin(y)) 

(f) F Fin(X) =► (X ^ F V Y =<X) 

(g) F Fin(X) A Inf(F) ^X <Y 

(h) h Fin(X) AFCX=yFxX 

(i) h Fin(X) A Fin(F) => Fin(X x Y) 

(j) h Fin (A) A Fin(F) => Fin(X y ) 

(k) FFin(X)A y£X^X <Xv{y} 

4.63 Define A" to be a minimal (respectively, maximal ) element of Y if and 
only if X G Y and (Vy) (y € Y =y ->{y C X)) (respectively, (Vy)(y € Y => -> 
(X C y ) ) ) . Prove that a set 7 is finite if and only if every non-empty set of 
subsets of Z has a minimal (respectively, maximal) element (Tarski, 1925). 

4.64 Prove: 

(a) F Fin (AT) A Den(F) => Den (Xu F) 

(b) F Fin(X) A Den(F) AX ^ 0 => Den(X x F) 

(c) F ( Vx) [Dedlnf(x) ^ (3y)(y Qx A Den(y))] . (A set is Dedekind-infinite 
if and only if it has a denumerable subset) 

(d) F (Vx)[(3y)(y C x A Den(y)) co jc] 

(e) F (Va)[(a ^ co => Dedlnf(a)) A (Va)(Inf(a) =y a ^ co) 

(f) F (Vx) (Vy) (y ^ x [Dedlnf(x) <=> x = xu{y}]) 

(g) F (Vx)(co x x + c 1 = x) 

4.65 If NBG is consistent, then, by Proposition 2.17, NBG has a denu- 
merable model. Explain why this does not contradict Cantor’s theorem, 
which implies that there exist non-denumerable infinite sets (such as co )). 
This apparent, but not genuine, contradiction is sometimes called Skolem’s 
paradox. 


4.4 HARTOGS’ THEOREM. INITIAL ORDINALS. 
ORDINAL ARITHMETIC 


An unjustly neglected proposition with many uses in set theory is Hartogs’ 
theorem. 


PROPOSITION 4.31 (HARTOGS, 1915) 

F (Vx)(3a)(Vy)(y C x =y -i(a = y)). (For any set x, there is an ordinal that is 
not equinumerous with any subset of x.) 
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Assume that every ordinal a is equinumerous with some subset y of x. 
Hence, y= a for some /. Define a relation r on y by stipulating that (u, v) £ t - 

if and only if pu £ f‘v. Then r is a well-ordering of y such that (r,y) is 
similar to (E a , a). Now define a function F with domain On such that, for 
any a, F l a is the set w of all pairs (z, y) such that y C x, z is a well-ordering of 
y, and (E a ,a) is similar to (z,y). (w is a set, since w C g?(x xx) x ^(x).) 
Since, F u (On) C x xx)x ^(jc)), F u (On ) is a set. F is one-one; hence, 
On = F“(F“(On)) is a set by the replacement axiom, contradicting Propo- 
sition 4.8(h). 


DEFINITION 

Let Xf denote the function with domain V such that, for every jc, yf l x is the 
least ordinal a that is not equinumerous with any subset of jc. {Xf is called 
Hartogs ‘ function.) 


COROLLARY 4.32 

(Vjc) {yOx < 


Proof 

With each fl < D ^‘jc, associate the set of relations r such that r C jc x jc, r is a 
well-ordering of its field y, and (r,y) is similar to (Ep, fl). This defines a one- 
one function from J^‘jc into ^^{x x x). Hence, yOx x jc). By Ex- 
ercise 4.12(s), jc xr C So, SP&ix xjc) C ^^^(jc), and therefore, 

2/e‘x < 

DEFINITION 

Init(Y) for X £ On A {\/(l){fl < 0 X =► Pfl = X)) 

(X is an initial ordinal ) 

An initial ordinal is an ordinal that is not equinumerous with any smaller 
ordinal. 


Exercises 

4.66 (a) h (Va)(a <E co => Init(a)). (Every finite ordinal is an initial ordinal.) 
(b) (- Init(co). [Hint: Use Proposition 4.26(b) for both parts.] 
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4.67 Prove: 

(a) For every x, Xf l x is an initial ordinal. 

(b) For any ordinal a, Xt’a. is the least initial ordinal greater than a. 

(c) For any set x, Jf’bc = co if any only if x is infinite and x is Dedekind- 
finite. [Hint: Exercise 4.64(c).] 

Definition by transftnite induction (Proposition 4.14(b)) yields a function 
G with domain On such that 

G‘0 = co 

G l [a!) = for every a 

G l X = ^J(G“(2)) for every limit ordinal X 


PROPOSITION 4.33 

(a) h (Va)(Init(G £ a) A co^ 0 G l a A {V(3){(3 < D a => G l [3 < 0 G £ a)) 

(b) I- (Va)(a^ 0 G £ a) 

(c) h (V/T)(co ^ o p A Init(/?) => (3a)(G £ a = /?)) 


Proof 

(a) Let X = {a| Init(G £ a) A co^ 0 G £ a A (V/7)(/7 < c a =}► G £ /? < 0 G £ a)}. 

We must show that On C X. To do this, we use the second form of trans- 
finite induction (Proposition 4.13(a)). First, 0 E X, since G% — co. Second, 
assume a EX. We must show that a! EX. Since a EX, G £ a is an infinite 
initial ordinal such that (V/?) (/? < n a => G £ /? < n G £ a). By definition, 
G £ (a') = ^f‘(G‘a), the least initial ordinal > D G £ (a). Assume (3 < c o'. Then 
(3 < 0 a V p — a. If < 0 a, then, since a E X, G l (3 < c G‘a < D G £ (c/). If = ot, 
then G £ /? = G £ a < c G‘(a')- In either case, G l (3 < c G £ (oc'), Hence, a! E X. Fi- 
nally, assume Lim(a) A (V/?)(/( <„ a => (3 E X). We must show that a E X. 
By definition, G‘a = (J(G“(a)). Now consider any (3 < D ot. Since 
Lim(a),^ / < c a. By assumption, ft EX, that is, G l ((3 r ) is an infinite initial 
ordinal such that, for any y < 0 (3', G l y < c G\fi). It follows that G“(a) is a 
non-empty set of ordinals without a maximum and, therefore, by Proposi- 
tion 4.12, G £ a, which is (J(G“(a)), is a limit ordinal that is the least upper- 
bound of G“(a). To conclude that G £ a EX, we must show that G £ a is an 
initial ordinal. For the sake of contradiction, assume that there exist S such 
that d < 0 G‘(a) and <5 = G £ a. Since G £ a is the least upper bound of G ££ (a), 
there must exist some in G“(a) such that S < D fi. Say, /i — G l (3 with (3 < D a. 
So, SC/u = G l (3c G L (f3') C G £ a ^ <5. Since 5 c G £ (/?'), S E G £ (/?') and 
S^G^fi'). On the other hand, since G £ (/? ; ) C G l a = <5, G l ([3') <5. By 

Bernstein’s theorem, <5 ~ G‘(//), contradicting the fact that G l ((3 f ) is an in- 
itial ordinal. 




(b) This follows from Corollary 4.16 and part (a). 

(c) Assume, for the sake of contradiction, that there is an infinite initial 
ordinal that is not in the range of G, and let a be the least such. By part (b)« 

G l c and, by part (a), G l a is an initial ordinal. Since c is not in the range 
of G , <7 < 0 G‘<t. Let n be the least ordinal such that g < c G‘/i. Clearly, /j =£ a 
since G‘0 = co < D g. Assume first that fi is a successor ordinal y'. Then, w 
the minimality of ju, G‘y < D a. Since G‘(y') = je'(G-y), G‘(/) is the feast! 
initial ordinal greater than G‘y. However, this contradicts the fact that a is 
an initial ordinal greater than G l y and g < n G‘(y'). So, fi must be a limit 
ordinal. Since G l /.i — |J(G“(^)), the least upper bound of G“(^), an( j 
g < 0 G l fi, there is some S < n ji such that g < d G‘S < 0 G l fi , contradicting the 
minimality of ju. 

Thus, by Proposition 4.33, G is a one-one < D -preserving function from 
On onto the class of all infinite initial ordinals. 

NOTATION 

co a for G‘a 

Hence, (a) co 0 = co; (b) ov is the least initial ordinal greater than co a ; (c) for a 
limit ordinal A, co;, is the initial ordinal that is the least upper bound of the 
set of all co y with y < c A. Moreover, co a > 0 a for all a. In addition, any 
infinite ordinal a is equinumerous with a unique initial ordinal cop^ 0 oc f 
namely, with the least ordinal equinumerous with a. 

Let us return now to ordinal arithmetic. We already have defined ordinal 
addition, multiplication and exponentiation (see Examples 1-2 on pages 
249-50 and Exercise 4.36). 


PROPOSITION 4.34 

The following wfs are theorems. 

(a) p+o 1 = /S' 

(b) 0+o/? = 0 

(c) 0 < 0 0 ==> (a < 0 cl +o 0 A 0^o a +o 0) 

(d) 0 < 0 y => a + 0 0 <o a + 0 y 

(e) a + 0 ji — cl. -|- 0 S 0 — <5 

(f) a < 0 0 => ( 3 i< 5 )(a + o S — 0) 

(g) 0 7^ x C On => a + 0 U 0 = |J ( a +o 0 ) 

P^x P& 

(h) 0 <o a A 1 < 0 0 => a < 0 a x D 0 

(i) 0 < D a A0 < 0 0 => a< 0 a x c 0 

(j) y +o 0 A 0 <Co ot — cl x c y <Co ot 7 < 0 0 

(k) x C On =+ a x c |J 0 = (J (a x D 0) 

fi€X /?£\ 
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proof 

.(a) P 3- o 1 = P +o (00 = (P +o 0/ = P' 

;j[b) Prove 0 -j- G /? = /? by transfinite induction (Proposition 4.13(a)). Let 
X = {p\® -\- 0 P = P}- First, 0 E X, since 0 + 0 0 = 0- If 0 +o y = y, then 
0 4 - c y — (0 + 0 y)' = y'. If Lim(a) and 0 3- 0 t = t for all x < 0 a, then 
0 +o a = U (0 +o T ) ~ U x — a, since U t is the least upper bound of 

T< 0 tt f< 0 a T< 0 a 

the set of all t < c a, which is a. 

(c) Let X = {P 1 0 <o P => a <o a To P\- Prove X = On by transfinite in- 

duction. Clearly, 0 eA\ If y E X, then a^ 0 a. + 0 T, hence 
a< 0 «+oy<o ( a +o y)' — a +o /• If Lim(A) and x E X for all x < c A, 
then a< 0 a' = a+ 0 l< 0 U (a + 0 t) = a + 0 A. The second part is left 
as an exercise. T< °* 

(d) Lei X = {y\ (Va)(V/?) {P <o y ^ u+o P <o a +o y) } and use transfinite 
induction. Clearly, 0 E X. Assume y E X and p < 0 y'. Then P < 0 y or 
P = y- If P <o y then, since y € X, a + 0 p < D a + 0 y < a (a +o y)' 
= a +o /■ If P = y, then a + c P = a+ D f <o (a +o y )' — a + 0 /• Hence, 
/6l. Assume Lim (A) and t E X for all x < c A. Assume p < D A. Then 

P<oT for some r < c A, since Lim(A). Hence, since x EX, 

a +o P <o « +o r<oUT< c i( a +o t) = a + D A. Hence, AeX. 

(e) Assume a + 0 p — a + 0 <5. Now, either p < D 8 or <5 < D P or S — p. If 
P < o <5. then a + 0 /? < 0 a + 0 ^ by part (d), and, if & <o P, then 
a +o <5 <o a To by part (d); in either case, we get a contradiction with 
x+o P = a-\- 0 d. Hence, 8 = p. 

(f) The uniqueness follows from part (e). Prove the existence by induction 
on p. Let A" = {/?|oc < c p => (3i<5)(a + 0 8 — P)}. Clearly, 0 E X. Assume 
y E X and a < 0 /. Hence, a — y or a < 0 y. If a = y, then 
(3<5)(a + 0 8 — y f ), namely, <5=1. If a < f) y, then, since y E X, 
(3i<5)(a + 0 8 = y). Take an ordinal o such that 0 L-\- o cr = y. Then 
a + 0 o' — (a +o &)' = /; thus, (3<5)(a + 0 8 = y'); hence, y' E X. Assume 
now that Lim (A) and tel for all x < c A. Assume a < 0 A. Now define a 
function / such that, for a < D p < c A ,/‘p is the unique ordinal 5 such 
that a+ 0 5 = /i. But A = U a<o/t<0 A^ = Ua< D ,« 0 ;X a +<>./»• Let 
P = U«<, (/TO- Notice that, if a < D p < c A, then /‘p < 0 /‘(p'); 
hence, p is a limit ordinal. Then A = U a < 0 «< 0 A (a+ 0 /‘p) = 

Utr< 0 p( a +o<7) =a+oP- 

(g) Assume 0 y x C 0«. By part (f), there is some 8 such that 
a + 0 8 = U«er( a +<> P)- We must show that 8 = (L er /?. If p E jc, then 
a+o]S^o«+o^ Hence, p^ 0 8 by part (d). Therefore, <5 is an upper 
bound of the set of all /? in x. So, U/?er P^od- On the other hand, if 
p ex, then a + 0 p < D a + D p. Hence, a + 0 8 = (Jp €JC (a +o P) 

^o“+oU p&P- Hence, a + 0 5 = U/fec( a +° P) +o Upe*/* and so ’ 

by part (d), 8^ 0 \J p& p. Therefore, 8 = jj p& P- 

(h) -(k) are left as exercises. 
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AXIOMATIC SET THEORY 


PROPOSITION 4.35 The following wfs are theorems. 



(a) p x 0 l=(J/\lx 0 (] = p 

(b) 0x o /? = 0 

(c) (ot -H 0 /?) +o y = a -Ho {P +o y) 

(d) (a x 0 p) x 0 y = a x D (P x c y) 

(e) a x D (p + 0 y) = (a x c p) + D (a x 0 y) 

(f) exp(/?, 1) = /?Aexp(l,0) = 1 

(g) exp(exp(/?, y), <5) = exp(/?, yx c 5) 

(h) exp (P, y + 0 5) = exp (p, y) x G exp(p, <5) f 

(i) a > 0 l A P < 0 y =H exp (a, p) < D exp(a, y) 


Proof 

(a) p x D 1 = p x 0 0' = (P x 0 0) + 0 p = 0 -y 0 p = p, by Proposition 4.34(b). 

Prove 1 x 0 p = p by transfinite induction. 

(b) Prove 0 x c p = 0 by transfinite induction. 

(c) Let X={y | (Va)(V/?)((a /?) + 0 y = a + D (/? + 0 y))}. 0 6l, since 
(a +o P) +o 0 = (a + 0 P) — a +o (P +o 0). Now assume y eX. Thai 

(a + 0 p) + 0 y' = ((a + D p) + c y)' = (a + D (p + 0 y))' = a + 0 (p + 0 y)' == 
a +o (/? +o y ')■ Hence, y' e X. Assume now that Lim(A) and r e X for 
all t < D A. Then (a + 0 p ) -y D A = U T < 0 A(( a "ho P) -Ho t) = U T <^(a-H 0 
(P +ot)) =a+o Ur< 0 A {P +o t), by Proposition 4.34(g), and this is equal 
to a T - 0 ( P -Ho A). 

(d) -(i) are left as exercises. 

We would like to consider for a moment the properties pf ordinal addi- 
tion, multiplication and exponentiation when restricted to co. 


PROPOSITION 4.36 

Assume a, /?, y are in co. Then: 


(a) a -H 0 p E co 

(b) a x D p 6 co 

(c) exp(a, P) E co 

(d) ot -Ho p = p -Ho a 

(e) a x 0 p = p x 0 a 

(f) (a -Ho P ) x c y = (ot x c y) -h d (P x d y) 

(g) exp(a x Q p, y) = exp(a, y) x D exp (P, y) 


tin traditional notation, the results of (f)— (h) would be written as p ] = p, 

P = l, (p y ) 6 = py x °\py+° s = py * 0 p 6 . 
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Proof 

(a) Use induction up to co (Proposition 4.13(c)). Let X = mp G co/\ 
(Va)(a € co) a + D (3 6 co)}. Clearly, 0 Gl. Assume J. Consider 
any a € co. Then a + c /? € co. Hence, a + D /J' = (a + D /?)' € co by Prop- 
osition 4.1 1(a). Thus, El. 

(b) and (c) are left as exercises. 

(d) Lemma, h a E co A /? E co => a' + c = a + D // Let Y — {(3 \ (3 E co 
A(Va) (a E co => a' + D /? = a + c /?')}. Clearly, 0 E F. Assume /? E Y. 
Consider any a E co. So, a' + D /3 = a + 0 ft . Then a' -|- 0 ft = (a' -f- 0 /?) ; = 
(a + 0 ft)' = a + 0 05')'- Hence, /?' E F. 

To prove (d), let X — {(3 \ (3ecoA (Va)(a Effl^a-(- 0 /l = j?+o a)}. 
Then 0 E X and it is easy to prove, using the lemma, that (3 E X ft EX. 

(e) -(g) are left as exercises. 

The reader will have noticed that we have not asserted for ordinals cer- 
tain well-known laws, such as the commutative laws for addition and 
multiplication, that hold for other familiar number systems. In fact, these 
laws fail for ordinals, as the following examples show. 

Examples 

1. (3a) (3 (3){a -\- 0 (3 ^ (3 -\- 0 a) 

1 +0 0 ) = (J (1 + 0 a) = co 

a< 0 to 

CO +0 1 = CO r >0 CO 

2. (3a)(3j?)(a x D (3 ^ (3 x c a) 

2 x c co = M (2 x 0 a) = co 

a< 0 co 

CO X 0 2 = 0) x 0 (1 -f 0 1) = (co x 0 1) +0 (co x 0 1) = CO + 0 (!) > 0 CO 

3. (3a)(3/?)(3y)((a + D /?) x D y ± (a x D y) + 0 ( (3 x D y)) 

(1 +0 1) X 0 CO = 2 X 0 CO — CO 
(1 x 0 co) + 0 (1 x 0 co) — CO +0 CO >0 CO 

4. (3a)(3£)(3y)(exp(a x D (3, y) ^ exp(a, y) x D exp (/?, y)) 

exp(2 x 0 2, co) = exp(4, co) = exp(4, a) = co 

a<o 03 

exp(2, co) = M exp(2, a) = co 

a< 0 co 

So, exp (2, co) x 0 exp(2, co) — co x 0 co > D co. 

Given any wf 3ft of formal number theory S (see Chapter 3), we can 
associate with J a wf of NBG as follows: first, replace every ‘ ’ by ‘+ 0 ’, 

every *•’ by ‘x c ’, and every ‘//(f)’ by ‘t U then, if 3ft is # =s- 2 or 

Un abbreviated notation for S, ‘// (f)’ is written as f, and in abbreviated no- 
tation in NBG, 7 U {/■}’ is written as A So, no change will take place in these 
abbreviated notations. 




respectively, and we have already found and 23* , let 3d* be <3* => £%* Qr 
33* , respectively; if 3d is (Vx)^(x), replace it by (Vx)(x € co => <3* (*))- This 
completes the definition of 3d*. Now, if xj , . . . ,x„ are the free variables (if 
any) of 38, prefix (xi € co A . . . A x„ € co) => to 38*, obtaining a wf 3d#. This 
amounts to restricting all variables to co and interpreting addition, multi- 
plication and the successor function on natural numbers as the corre- 
sponding operations on ordinals. Then every axiom 3d of S is transformed 
into a theorem 3d# of NBG. (Axioms (S1)-(S3) are obviously transformed 
into theorems, (S4)# is a theorem by Proposition 4.10(c), and (S5)#-(S8)# 
arc properties of ordinal addition and multiplication.) Now, for any wf 3d of 
S, 38# is predicative. Hence, by Proposition 4.4, all instances of (S9)# are 
provable by Proposition 4.13(c). (In fact, assume ^#(0) A (Vx)(x e to 
=> (^#(*) => -^#(*'))))* Let X = {y I y € to A -^#(.y)}. Then, by Proposi- 
tion 4.13(c), (Vx)(x € to => 3djf(x)).) Applications of modus ponens 
are easily seen to be preserved under the transformation of 3d into 3d#. 
As for the generalization rule, consider a wf 3d(x) and assume that 
3d#(x) is provable in NBG. But 3d-#[x) is of the form 
JtEtoAyi 6 co A . . . Ay„ G co =y 3d*(x). Hence, yi € co A . . . A y„ E co => (Vx) 
(x € to => 3d* (x) ) is provable in NBG. But this wf is just ((Vx)3d(x))#. Hence, 
application of Gen leads from theorems to theorems. Therefore, for every 
theorem 3d of S, 3d# is a theorem of NBG, and we can translate into NBG 
all the theorems of S proved in Chapter 3. 

One can check that the number-theoretic function h such that, if x is the 
Godel number of a wf 3d of S, then h(x) is the Godel number of 3d#, and if x 
is not the Godel number of a wf of S, then h(x) = 0, is recursive (in fact, 
primitive recursive). Let K. be any consistent extension of NBG. As we saw 
above, if x is the Godel number of a theorem of S, then h(x) is the Godel 
number of a theorem of NBG and, hence, also a theorem of K. Let S(K) be 
the extension of S obtained by taking as axioms all wfs 38 of the language of 
S such that 3d# is a theorem of K. Since K is consistent, S(K) must be 
consistent. Therefore, since S is essentially recursively undecidable (by 
Corollary 3.46), S(K) is recursively undecidable. Now, assume K is recur- 
sively decidable; that is, the set Tk of Godel numbers of theorems of K is 
recursive. But CV s(lt (x) = C TK (h(x)) for any x, where C TsiK] and C Tk are the 
characteristic functions of T S { K ) and 7k- Hence, T S( k) would be recursive, 
contradicting the recursive undecidability of S(K). Therefore, K is recur- 
sively undecidable, and thus, if NBG is consistent, NBG is essentially re- 
cursively undecidable. Recursive undecidability of a recursively 
axiomatizable theory implies incompleteness (see Proposition 3.47). Hence, 
NBG is also essentially incomplete. Thus, we have the following result: if 
NBG is consistent, then NBG is essentially recursively undecidable and es- 
sentially incomplete. (It is possible to prove this result directly in the same 
way that the corresponding result was proved for S in Chapter 3.) 
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Exercise 

4.68 Prove that a predicate calculus with a single binary predicate letter is 
recursively undecidable. [Hint: Use Proposition 3.49 and the fact that NBG 
has a finite number of proper axioms.] 

There are a few facts about the ‘cardinal arithmetic’ of ordinal numbers that 
we would like to deal with now. By ‘cardinal arithmetic’ we mean properties 
connected with the operations of union (U), Cartesian product (x) and X y , 
as opposed to the properties of + D , x c and exp. Observe that x is distinct 
from x D ; also notice that ordinal exponentiation exp(a, / 3 ) has nothing to do 
with X r > the class of all functions from Y into X. (From Example 4 on page 
269 we see that exp( 2 , co) is co, whereas, from Cantor’s theorem, co -<: 2°\ 
where 2 “ is the set of functions from co into 2 . 


PROPOSITION 4.37 

(a) b co x co = co 

(b) b 2 ^JA 2 ^f=>JUf^Ixy 

(c) b Den(x) A Den(y) =y Den(x Uy) 

Proof 

(a) Let / be a function with domain co such that, if a € co, then /‘a = (a, 0). 
Then / is a one-one function from co into a subset of co x co. Hence, 
co=^co x co. Conversely, let g be a function with domain co x co such 
that, for any (a, p) in co x co, g l (cc, p) — exp( 2 , a) x c exp(3, p). We leave 
it as an exercise to show that g is a one-one function from co x co into 
co. Hence, co x co=^co. So, by Bernstein’s theorem, co x co = co. 

(b) Assume a\ EX,aj EX,ai ^ aj,b\ € Y,bj £ Y,b\ 7 ^^. Define 

if* ex 

(oi , x) if x e Y — X and x ^b\ 

(° 2 ) bj) if x — b\ and x e Y — X 

Then f is a one one function with domain XUY and range a subset of 

X x Y. Hence, XuY X x Y. 

(c) Assume Den(x) and Den(y). Hence, each of x and y contains at least 

two elements. Then, by part (b), x U x y. But x = co and y — co. 

Hence, Jt x y = co x co. Therefore, x U y^co x co = co. By Proposition 
4.30, either Den (x U y) or Fin (x U y). But x C x U y and Den(x); hence, 
-iFin(x U y). 

For the further study of ordinal addition and multiplication, it is quite 

useful to obtain concrete interpretations of these operations. 
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PROPOSITION 4.38 (ADDITION) 

Assume that ( r,x ) is similar to (E«, a), that (5,3/) is similar to (E p , fi), and that 
xny=0. Let t be the relation on x U y consisting of all {u,v) such that 
(ti,v) E x x y or u E x A v E x A (u, v) E r or u E y A v E y A (u, d) Es (that is, 
t is the same as r in the set x, the same as s in the set y, and every element of* 
^-precedes every element of y). Then t is a well-ordering of x U y, and 
(/,rUy) is similar to (E a+oP , a + D fi). 

Proof 

First, it is simple to verify that ms a well-ordering of * Uy, since r is a well- 
ordering of* and s is a well-ordering of y. To show that (t,x U y) is similar to 
(E a+o 0 ,a-f- o fi), use transfinite induction on fi. For /? = 0, y = 0. Hence, 
t = r,x Uy = *, and a + 0 fi = a. So, (t, a Ufi) is similar to (E a+oP , a + 0 fi) . 
Assume the proposition for y and let /? — y'. Since (s,y) is similar to (Ep , fi ) , 
we have a function / with domain y and range fi such that, for any u, u in 
y, (u, v) E s if and only if f l u E f l v. Let b = (/)*}’, let yi = y — {£} and let 
= 5 fl (yi x yj ) . Since b is the ^-maximum of y, it follows easily that si 
well-orders y\. Also, y\ if is a similarity mapping of y\ onto y. Let 
/] = zn ((*Uyi) x (* Uyi)). By inductive hypothesis, (t\ ,x U yi) is similar to 
(E a+o? , a + 0 y), by means of some similarity mapping g with domain * Uy\ 
and range a + 0 y. Extend g to g\ = g U {( 6 , a + 0 y)}, which is a similarity 
mapping of * Uy onto (a +0 y) — a +0 i — a +0 fi- Finally, if Lim (fi) and 
our proposition holds for all t < d fi, assume that / is a similarity mapping 
of y onto fi. Now, for each t < 0 fi, let y T = = s D (y T x y T ), and 

t T — t fi ((* Uy T ) x (* U y T )). By inductive hypothesis and Corollary 4.18(b), 
there is a unique similarity mapping g x of (£ T ,*Uy T ) with (E aHoT , a + 0 x); 
also, if T] < 0 %2 <0 fi , then, since (x Uy Tl )lg Tl is a similarity mapping of 
(t T] , * U y T] ) with (E a+oTl , a -ho tj) and, by the uniqueness of g u , (* Uy r] ) 
lg T2 = g ri ; that is, g Z2 is an extension of g Xi . Hence, if g= {J T<oP g x and 
X = [J z<o Ja + 0 t), then g is a similarity mapping of (t, U T < 0 /j( JC u y T )) with 
{Ei,X). But, U T<0 p( x Uy T ) = xUy and U r < 0 /?( a -Ho t) = a -f 0 fi- This com- 
pletes the transfinite induction. 

PROPOSITION 4.39 (MULTIPLICATION) 

Assume that (r,x) is similar to (E a , a) and that (s, yj is similar to (E p , fi). Let 
the relation t on * x y consist of all pairs ( {u, v ) , (w, z) ) such that u and w are 
in * and v and z are in y, and either (r, z) E s or (v — z A (u, vv) E r). Then t is 
a well-ordering of * x y and (t,x x y) is similar to ( E aXoP , a x c fi)J 

tThe ordering t is called an inverse lexicographical ordering because it orders 
pairs as follows: first, according to the size of their second components and then, if 
their second components are equal, according to the size of their first components. 
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Proof 

This is left as an exercise. Proceed as in the proof of Proposition 4.38. 

Examples 

L 2 x c co = co. Let (g*) = (E2,2) and { s,y ) - (E M ,co). Then the Cartesian 
product 2 x co is well-ordered as follows: (0,0), (1,0), (0,1), (1,1), 
(0, 2), (1, 2), ... , (0,«), (1, n), (0, n + 1), (l,w + 1),... 

2. By Proposition 4.34(a), 2=1'= 1 + 0 1- Then by Proposition 4.35(e,a), 
co x 0 2 = (co x D 1) + 0 (co x D 1) — co + c co. Let (r,x) = (E w , co) and 
(j,y) = (E 2 , 2 ). Then the Cartesian product co x 2 is well-ordered as 
follows: (0,0), (1, 0), (2, 0), . . . , (0, 1), (1, 1), (2, 1), . . . 


PROPOSITION 4.40 


For all a, co a x co a = co a . 


Proof 

(Sierpinski, 1958) Assume this is false and let a be the least ordinal such that 
co K x coa = co a is false. Then cop x cop = cop for all /? < D a. By Proposition 
4.37(a), a > 0 0. Now let P — co a x co 0 and, for (3 < D co a , let Pp = {(y, <5)|y 
+ 0 «5 = /?}. First we wish to show that P~\Jp <oW *Pp. Now, if 
y+o <5 = (3 < 0 then y^ 0 (3 < o and S^ 0 (3 < D co a ; hence, (y,<5) E 

xco a = P. Thus, U/j< o 0 j a E /3 CP. To show that P C U/?< 0 w a E/j, it suffices to 
show that, if y < D co a and <5 < c co a , then y + D <5 < D co a . This is clear when y or 
<5 is finite. Hence, we may assume that y and S are equinumerous with initial 
ordinals co c < 0 y and co p ^ 0 S, respectively. Let ( be the larger of a and p. 
Since y < 0 co a and S < 0 co a , then co^ < D co a . Hence, by the minimality of 
a, co( x co( = <x>£. Let r = yx {0} and y— S x {1}. Then, by Proposition 
4.38, xUy=y+ 0 <5. Since y = co a and S = co p ,x = co a x {0} and 

y=co p x {1}. Hence, since jcfly = 0 ,jc Uy ^ (co ff x {0}) U ( co p x {1}). But, 
by Proposition 4.37(b), (co a x {0}) U ( co p x { 1 } ) ^ (co CT x {0}) x(co p x 
1}) = co a x co p =^co( x co^~ cof. Hence, y -|- 0 S^co^ < D co a . It follows that 
y + 0 S < 0 co a . (If co a ^ c y +o <5, then co a ^co(. Since co^ < D co a , co^co a . So, by 
Bernstein’s theorem, co a = co f, contradicting the fact that co a is an initial 
ordinal.) Thus, P = {Jp <o0)a Pp- Consider Pp for any (3 < 0 co a . By Proposition 
4.34(f), for each y< 0 /?, there is exactly one ordinal <5 such that y + D <5 = /?. 
Hence, there is a similarity mapping from onto Pp, where Pp is ordered 
according to the size of the first component y of the pairs (y, S). Define the 
following relation R on P. For any y < 0 co a , S < D co a , p < D co a , v < 0 (o K , 
((y, S), (p, v)) E R if and only if either y ~F 0 <5 < 0 p + 0 v or (y -|- 0 S = 
p + 0 v A y < 0 p). Thus, if /?] < D (i 2 <o co a, then the pairs in Pp x 7?- precede the 
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pairs in Pp 2 , and, within each Pp, the pairs are R-ordered according to the size 
of their first components. One easily verifies that R well-orders P. Since 
P — ay x co a , it suffices now to show that (R, P) is similar to (E Wrj ,oy). gy 
Proposition 4.19, { R,P ) is similar to some (Ef,£), where £ is an ordinal’ 1 
Hence, P = £. Assume that £ > D w a . There is a similarity mapping / bet- 
ween (Ef , 0 and <R, P). Let b = /‘(w a ); then b is an ordered pair (y, S) with! 
y <o w a ,S < 0 w a , and w a If is a similarity mapping between (E^,^) ands 
the R-segment Y — Seg^(R, (y, <5)) of P determined by (}>, 5). Then Y Si co*. If 
we let P = y + c <5, then, if (<?, p) 6 T,we have o + 0 p^ D y -y 0 5 = fj; hence, 
and p^ o P- Therefore, FC^x^. But < D <n a . Since p is obviously 
not finite, p' = w fl with p < 0 a. By the minimality of a, co ;i x a) fl = ox,. So, 
« a = 7=^co /( , contradicting w yi -< co K . Thus, £ < 0 oj„ and, therefore, P=^<y 
Let h be the function with domain ay such that h ( p = (/?, 0) for every 
p < 0 w a . Then h is one-one correspondence between w a and the subset 
w a x {0} of P and, therefore, co a ^P- By Bernstein’s theorem, w a p, 
contradicting the definition of a. Hence, wp x wp = cap for all p. 


COROLLARY 4.41 

If x = w a and y = cop, and if y is the maximum of a and P, then jcxjs ay 
and x Uy = w y . In particular, w a x cop = ay . 


Proof 

By Propositions 4.40 and 4.37(b), U x y ay x wp^w y x ay ^ 

ay. Hence, by Bernstein’s theorem, x x y = co y and rUy = co y . 

Exercises 

4.69 Prove that the following are theorems of NBG. 

(a) X^CtJa => X U co K = w a 

(b) Wa +C W a = W a 

(c) 0 ^ X^Wa X X Wa = w a 

(d) 0 ^ x -< w (w a )' v = W a 

4.70. Prove that the following are theorems of NBG 

(a) 0>{Wa) X &(Wa) = W a ) 

(b) X^^(Wa) =FrU 0>{Wa) = ^{w a ) 

(c) 0 ^ X^^{Wa) =>■ x X ^(C0 a ) ^ &{Wa) 

(d) 0 ^ => (^(Wa)) V = 0>(cOa) 

(e) 1 -< X^Wa =Ytf° a = (w a ) 03 ’ = (^(w a )) W5 = ^(Wa) 

4.71 Assume y yb 0 a y = y+ c y. (This assumption holds for y=w a by Cor- 
ollary 4.41 and for y = ^(w a ) by Exercise 4.70(b). It will turn out to hold 
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for all infinite sets y if the axiom of choice holds.) Prove the following 
properties of y. 

(a) Inf(v) 

(b) y — 1 +c y 

(c) (3«)(3r)(y — u\Jv riuC\v = % /\u=y /\v = y) 

(d) {z\ z £ y A z — t} - 

(e) (z\z C y A Inf(z)} “ 0>{y) 

(f) (3/) (y=y A (Vm) (u e y=> p u f u)) 

4.72 Assume xj^A 1 -< y. (This holds when y = to a by Proposition 

4.40 and for y = ^(w a ) by Exercise 4.70(a). It is true for all infinite sets y if 
the axiom of choice holds.) Prove the following properties of y. 

(a) y^y +cy 

(b) D Let Perm(y) denote {f\y=y\. Then Perm(y) = ^(y). 


4.5 THE AXIOM OF CHOICE. THE AXIOM OF REGULARITY 

The axiom of choice is one of the most celebrated and contested statements 
of the theory of sets. We shall state it in the next proposition and show its 
equivalence to several other important assertions. 


PROPOSITION 4.42 

The following wfs are equivalent. 

(a) Axiom of choice (AC). For any set x, there is a function / such that, for 
any non-empty subset y of x, f l y € y. (f is called a choice function for 
x.) 

(b) Multiplicative axiom (Mult). If x is a set of pairwise disjoint non-empty 
sets, then there is a set y (called a choice set for x) such that y contains 
exactly one element of each set in x: 

(Vu)(u Ex=>i/^0A (Vr>)(t> G x A v f u =>- v n u — 0)) =$• 

(3y)(Vi/)(« G x =>- (3iu>)(u; G u Ply)) 

(c) Well-ordering principle (WO). Every set can be well-ordered: 
(V*)(3y)0 We*). 

(d) Trichotomy (Trich). (Vx)(Vy)(x=^y V j>^x)^ 

t'fhis is equivalent to (Vx)(Vy)(x -< y V x ~y Vy ^ x), which explains the name 
‘trichotomy’ for this principle. 
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(e) Zorn's Lemma (Zorn). Any non-empty partially ordered set x, in which- 
every chain (i.e., every totally ordered subset) has an upper bound, has 
a maximal element: 



(Vx) (Vy) ( [(y Part x) A (Vw)(w C x A y Tot u =>- 
(3v)(v 6 x A (W)(w 6 u =>- w — v V <w, r) e y)))] =>- 
(3r)(r 6 x A (Vvv)(vr G x (v, w)£ y))) 


Proof 

1 . h WO =y Trich. Given sets x and y, then, by WO, x and y can be ’well- 
ordered. Hence, by Proposition 4.19, x ~ a and y = P for some ordinals a 
and fi. But, by Exercise 4.52, a^p or p^oc. Therefore, x=^y or y=^x. 

2. E Trich => WO. Given a set x, Hartogs’ theorem yields an ordinal a 
such that a is not equinumerous with any subset of x, that is, a^x is false. 
So, by Trich, x^a, that is, x is equinumerous with some subset y of a. Hence 
by translating the well-ordering E,, of y to x, x can be well-ordered. 

3. h WO=> Mult. Let x be a set of non-empty pairwise disjoint sets. By 
WO, there is a well-ordering R of (Jx. Hence, there is a function / with 
domain x such that, for any u in x, f'u is the /?-least element of u, (Notice 
that u is a subset of (Jx.) 

4. h Mult =y AC. For any set x, we can define a one-one function g such 
that, for each non-empty subset u of x, g l u = u x {u\. Let x\ be the range of 
g. Then Xj is a set of non-empty pairwise disjoint sets. Hence, by Mult, there 
is a choice set y for xi . Therefore, if u is a non-empty subset of x, then 
u x {«} is in xi, and so y contains exactly one element (v, u ) in u x {«}. Then 
the function / such that f l u = v is a choice function for x. 

5. F AC Zorn. Let y partially order a non-empty set x such that every 
y-chain in x has an upper bound in x. By AC, there is a choice function / for 
x. Let b be any element of x. By transfinite induction (Proposition 4.14(a)), 
there is a function F such that F% = b and, for any a > 0 0,F < a is /‘w, where 
u is the set of y-upper bounds v in x of F“a such that v ^ F u a. Let P be the 
least ordinal such that the set of y-upper bounds in x of F“P that are not in 
F“p is empty. (There must be such an ordinal. Otherwise, F would be a one- 
one function with domain On and range a subset of x, which, by the re- 
placement axiom R, would imply that On is a set.) Let g — PIF. Then it is 
easy to check that g is one-one and, if a < G y < Q P, (g‘a, g l y) € y. Hence, g u p 
is ay-chain inx; by hypothesis, there is ay-upper bound w of g u p. Since the 
set of y-upper bounds of F Li p{= g“P) that are not in g u p is empty, w € g u P 
and w is the only y-upper bound of g“p (because a set can contain at most 
one of its y-upper bounds). Hence, w is a y-maximal element. (If (vv, z) Gy 
and z €x, then z is a y-upper bound of g u p, which is impossible.) 
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. 6. F Zorn => WO. Given a set z, let X be the class of all one-one functions 
with domain an ordinal and range a subset of z. By Hartogs’ theorem, X is a 
set. Clearly, 0 6 X. X is partially ordered by the proper inclusion relation C. 
Given any chain of functions in X, of any two, one is an extension of the 
other. Hence, the union of all the functions in the chain is also a one-one 
function from an ordinal into z, which is a C-upper bound of the chain. 
Hence, by Zorn, X has a maximal element g, which is a one-one function 
from an ordinal a into z. Assume z — g u a f 0 and let b e z — g li a. Let 
f ~ g U {< a, /? >}. Then / € X and g c /, contradicting the maximality of 
g. So, g“a = z. Thus, a = z. By means of g, we can transfer the well-ordering 

E a of a to a well-ordering of z. 


Exercises 

4.73 Show that each of the following is equivalent to the axiom of choice. 

(a) Any set x is equinumerous with some ordinal. 

(b) Special case of Zorn’s lemma. If x is a non-empty set and if the union of 
each non-empty c -chain in x is also in x, then x has a c-maximal 
element. 

(c) Hausdorjf maximal principle. If x is a set, then every c -chain in x is a 
subset of some maximal c-chain in x. 

(d) Teichmuller—Tukey Lemma. Any set of finite character has a c-maximal 
element. (A non-empty set x is said to be of finite character if and only 
if: (i) every finite subset of an element of x is also an element of x; and 
(ii) if every finite subset of a set y is a member of x, then y G x.) 

(e) (Vx)(Rel(x) (3j)(Fnc(y) A 3)(x) — S){y) A y Cx)) 

(f) For any non-empty sets x and y, either there is a function with domain x 
and range y or there is a function with domain y and range x. 

4.74 Show that the following finite axiom of choice is provable in NBG: if x 

is a finite set of nonempty disjoint sets, then there is a choice set y for x. 

[Hint: Assume x = a where a 6 co. Use induction on a.] 


PROPOSITION 4.43 

The following are consequences of the axiom of choice. 

(a) Any infinite set has a denumerable subset. 

(b) An infinite set is Dedekind-infinite. 

(c) If A is a denumberable set whose elements are denumerable sets, then 
(Jx is denumerable. 
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Proof 

Assume AC. 

(a) Let x be an infinite set. By Exercise 4.73(a), x is equinumerous with 
some ordinal a. Since x is infinite, so is a. Hence, co< 0 a; therefore, oj is 
equinumerous with some subset of x. 

(b) The proof is by part (a) and Exercise 4.64(c). 

(c) Assume x is a denumerable set of denumerable sets. Let f be a~ 
function assigning to each u in x the set of all one-one correspondences 
between u and co. Let z be the union of the range of /. Then, by AC applied 
to z, there is a function g such that g'v £ v for each non-empty v C z . In 
particular, if u 6 X, then g l (f l u) is a one-one correspondence between u and 
co. Let h be a one-one correspondence between co and x. Define a function F 
on (Jx as follows: let y € (J x and let n be the smallest element of <o such that 
y £ Fn. Now, Fn £ x; so, g L {f l {Fn)) is a one-one correspondence between 
Fn and co. Define F l y = («, ( g‘{f l {h l ri))) L y ). Then F is a one-one function 
with domain (Jx and range a subset of co x co. Hence, (Jx^co x co. But 
cox co = co and, therefore, (Jx^co. If v £ x', then d C |Jx and v ^ co. Hence, 
co=^ (J*- Bernstein’s theorem, (Jx = co. 


Exercises 

4.75 If x is a set, the Cartesian product n M e x u is the set of functions / with 
domain x such that fhc £u for all u £ x. Show that AC is equivalent to the 
proposition that the Cartesian product of any set x of non-empty sets is also 
non-empty. 

4.76 Show that AC implies that any partial ordering of a set x is included in 
a total ordering of x. 

4.77 Prove that the following is a consequence of AC: for any ordinal a, if x 
is a set such that x=^co E and such that (Vu)(u £ x => u^co a ), then (Jx=^co E . 
[Hint: The proof is like that of Proposition 4.43(c).] 

4.78 (a) Prove j;=^x =$■ (3/)(Fnc(/) A @(f) = x A — >>)• 

(b) Prove that AC implies the converse of part (a). 

4.79 D (a) Prove ( u + c v) 2 = u 2 -f c (2 x (w x r)) + c v 2 . 

(b) Assume y is a well-ordered set such that x x y = x + c y and 
->(y=^x). Prove that x=^y. 

(c) Assume y = y x y for all infinite sets y. Prove that, if Inf(x) and 
z = then x x z = x X c z. 

(d) Prove that AC is equivalent to (Vy)(Inf(y) y = j;x y) (Tarski, 
1923). 

A stronger form of the axiom of choice is the following sentence (UCF): 
(ELY) (Fnc(X) A (Vu)(u f 0 =>• X'u £ u)). (There is a universal choice function 
- that is, a function that assigns to every non-empty set u an element of u.) 
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UCF obviously implies AC, but W.B. Easton proved in 1964 that UCF is 
not provable from AC if NBG is consistent. However, Feigner (1971b) 
proved that, for any sentence 3ft in which all quantifiers are restricted to sets, 
jf 3ft is provable from NBG + (UCF), then 3ft is provable in NBG + (AC). 
(See Feigner (1976) for a thorough treatment of the relations between UCF 
and AC.) 

The theory of cardinal numbers can be simplified if we assume AC; for 
A C implies that every set is equinumerous with some ordinal and, therefore, 
that every set x is equinumerous with a unique initial ordinal, which can be 
designated as the cardinal number of x. Thus, the cardinal numbers would be 
identified with the initial ordinals. To conform with the standard notation 
for ordinals, we let stand for co H . Proposition 4.40 and Corollary 4.41 
establish some of the basic properties of addition and multiplication of 
cardinal numbers. 

The status of the axiom of choice has become less controversial in recent 
years. To most mathematicians it seems quite plausible, and it has so many 
important applications in practically all branches of mathematics that not to 
accept it would seem to be a wilful hobbling of the practising mathemati- 
cian. We shall discuss its consistency and independence later in this section. 

Another hypothesis that has been proposed as a basic principle of set 
theory is the so-called regularity axiom (Reg): 


(VAf)(X / 0 (3y)(y EX AyHX — 0)) 

(Every non-empty class X contains a member that is disjoint from X.) 


PROPOSITION 4.44 

(a) The regularity axiom implies the Fundierungsaxiom : 

->(3/)Fnc(/) A ®{f) = to A (Vw)(« € co =► f l (u') £ fu)) 

that is, there is no infinitely descending 6-sequence xq 3 x\ 3 xj 3 . . . 

(b) If we assume AC, then the Fundierungsaxiom implies the regularity 
axiom. 

(c) The regularity axiom implies the non-existence of finite 6-cycles that 
is, of functions / on a non-zero finite ordinal a such that 
/‘0 6 /‘I 6 ... 6 /‘a 6 /‘0. In particular, it implies that there is no set y 
such that y 6 y. 


Proof 

(a) Assume Fnc(/) A @{f) — co A (\/u)(u 6 co =>- f l {u') 6 fu). Let z = /“co. 
By (Reg), there is some element y in z such that y fl z = 0. Since y 6 z, there 
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is a finite ordinal a such that y = /‘a. Then /‘(a') eyHz, contradicting 
n z = 0. 

(b) First, we define the transitive closure TC («) of a set u. Intuitively, We 

want TC(m) to be the smallest transitive set that contains u. Define by- 
induction a function g on w such that g% = {«} and g l (a') — (J(c/‘a) for each! 
a in co. Thus, #‘l = u, g l 2 — (J«,0‘3 = (J(U W )» an( i so on. Let 
TC(m) = (J(<7“cu) be called the transitive closure of u. For any n, TC(u) is 5 
transitive; that is, (Vv)(v G TC (u) =^dC TC(m)). Now, assume AC and the 
Fundierungsaxiom; also, assume T/0 but there is no y in X such that 
ynX=0. Let b be some element of X ; hence, b nX / 0. Let 
c = TC (b) (IX. By AC, let h be a choice function for c. Define a function/ 
on co such that /‘0 = b and, for any a in co,/‘(a') = fl X). It follows 

easily that, for each a in co,/‘(oc') 6 /‘a, contradicting the Fundierungsaxi- 
om. (The proof can be summarized as follows: we start with an element b of 
X\ then, using h, we pick an element / T in br\X; since, by assumption, f‘\ 
and X cannot be disjoint, we pick an element /‘2 in f T DX, and so on.) 

(c) Assume given a finite 6-cycle: f% € /‘I 6 ... 6 f l n 6 /‘0. Let X be 
the range of /: {/‘0,/T, . . . ,/‘m}. By (Reg), there is some f l j vaX such that 
f\j n X — 0. But each element of X has an element in common with XK 


Exercises 

4.80 If z is a transitive set such that u 6 z, prove that TC(«) C z. 

4.81 By the principle of dependent choices (PDC) we mean the following: if r 
is a non-empty relation whose range is a subset of its domain, then there is a 
function f:w — »• @(r) such that (Vw)(c/ 6 co => (/‘n,/‘(z/)) 6 r) (Mostowski, 
1948). 

(a) Prove h AC PDC. 

(b) Show that PDC implies the denumerable axiom of choice (DAC): 
Den(x) A (V«)(w 6r=>-i/0)=^ (3/)(/ : x — ■* [Jx A (Vz/)(z/ 6 x => /‘zz 6 i/)) 

(c) Prove h PDC =>- (Vx)(Inf(x) =>- co=^x) (Hence, by Exercise 4.64(c), PDC 
implies that a set is infinite if and only if it is Dedekind-infinite.) 

(d) Prove that the conjunction of PDC and the Fundierungsaxiom implies 

(Reg). 

Let us define by transfinite induction the following function T with do- 
main On: 


*The use of AC in deriving (Reg) from the Fundierungsaxiom is necessary. 
Mendelson (1958) proved that, if NBG is consistent and if we add the Fundie- 
rungsaxiom as an axiom, then (Reg) is not provable in this enlarged theory. 
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Let // stand for (J(T“0 ?j), that is, H consists of all members of sets of the 
form 'Fa. Let H p stand for Thus, H p = ^ l p) and Hp = ^(T‘ (/?')) 

= ^(Hf)). In particular, H$ — ^(T‘0) = ^(0) — {0}, H) = ^(// 0 ) = 

0>m) = (0> {0}}, and H 2 = n^i) = ^({0, {0}}) - {0, {0}, {{0}}, 

{ 0 ,{ 0 }}} 

Define a function p on H such that, for any x in H, p l x is the least ordinal 
a such that x 6 T‘a. p l x is called the rank of x. Observe that p‘x must be a 
successor ordinal. (In fact, there are no sets of rank 0, since *F0 = 0. If ). is a 
limit ordinal, every set in T‘2 already was a member of T‘/i for some 
^ <o 1) As examples, note that p% — l,p‘{0} = 1, p‘{0, {0}} = 2, and 
= 2 - 


Exercise 4.82. Prove that the following are theorems of NBG. 


(a) (Va)Trans('Fa) 

(b) Trans (H) 

(c) (Va)0FaCT‘(a')) 

(d) (Va)(V/?)(a < D p ^ 'Fa C p ) 

(e) On CH 

(f) (Va)(p‘a = a') 

(g) (V«) (Vr) {u gHAvgH A uGv=5- p‘u < 0 p l v) 

(h) (V?/)(« CH^ueH) 


PROPOSITION 4.45 

The regularity axiom is equivalent to the assertion that V = H, that is, that 
every set is a member of H . 


Proof 

(a) Assume V ~ H. Let X ^ 0. Let a be the least of the ranks of all the 
members of X, and let b be an element of X such that p l b = a. Then 
bnX = 0; for, if u E b fl X, then, by Exercise 4.82(g), p‘w G p l b = a, 
contradicting the minimality of a. 

(b) Assume (Reg). Assume V -/ H. Then V — H ^ 0. By (Reg), there is 
some y in V — H such that y fl (V — H) =0. Hence, y QH and so, by 
Exercise 4.82(h), y GH, contradicting y G V — H. 
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4.83 Show that (Reg) is equivalent to the special case: (Vx)(x f 0 
=>- (3j)(r 6 x A y n x = 0)). 

4.84 Show that, if we assume (Reg), then Ord(2f) is equivalent to 
Trans (Y) A E Con X, that is, to the wf 

(Vu){u £ X u C X) A (Vw)(Vti)(i/ GA'AdG JAii^ uViiGh) 

Thus, with the regularity axiom, a much simpler definition of the notion of 
ordinal class is available, a definition in which all quantifiers are restricted to 
sets. 

4.85 Show that (Reg) implies that every non-empty transitive class con- 
tains 0 

Proposition 4.45 certainly increases the attractiveness of adding (Reg) as 
a new axiom to NBG. The proposition V = H asserts that every set can be 
obtained by starting with 0 and applying the power set and union operations 
any transfinite number of times. The assumption that this is so is called the 
iterative conception of set. Many set theorists now regard this conception as 
the best available formalization of our intuitive picture of the universe of 
sets.* 

By Exercise 4.84, the regularity axiom would also simplify the definition 
of ordinal numbers. In addition, we can develop the theory of cardinal 
numbers on the basis of the regularity axiom; namely, just define the car- 
dinal number of a set x to be the set of all those y of lowest rank such that 
y = x. This would satisfy the basic requirement of a theory of cardinal 
numbers, the existence of a function Card whose domain is V and such that 
(Vx)(Vj)(Card‘x = Card‘s yy x = y). 

There is no unanimity among mathematicians about whether we have 
sufficient grounds for adding (Reg) as a new axiom, for, although it has 
great simplifying power, it does not have the immediate plausibility that 
even the axiom of choice has, nor has it had any mathematical applications. 
Nevertheless, it is now often taken without explicit mention to be one of the 
axioms. 

The class H determines an inner tnodel of NBG in the following sense. 
For any wf (written in unabbreviated notation), let Rel//($?) be the wf 
obtained from Oft by replacing every subformula (ffX)^{X) by 
(VX) (X QH =y &(X)) (in making the replacements we start with the in- 


^The iterative conception seems to presuppose that we understand the power set 
and union operations and that ordinal numbers (or something essentially equivalent 
to them) are available for carrying out the transfinite iteration of the power set and 
union operations. 



» • • 




nermost subformulas) and then, if ^ contains free variables. Fi, 



prefixing (Fj C H A F 2 C // a . . . A Y„ C ff) =k 

In other words, in forming Rel//(,^), we interpret ‘’class’ as ‘subclass of 
H\ Since M(Af) stands for (3F)(X € F), Rel 7/ (M(A0) is 
(3F)(F C H A X 6 F), which is equivalent to X e H; thus, the ‘sets’ of the 
model are the elements of H. Hence, Rel//((Vx)^) is equivalent to 
(Vx) (x e H =>• where 38^ is Rel/y (^?) . Note also that 

\- X Q H l\Y C H =>- [Rel/f(X = F) o X = Y]. Then it turns out that, for 
any theorem 'ffl of NBG, Rel//(,^) is also a theorem of NBG. 


Exercises 

4.86 Verify that, for each axiom OS of NBG, Fnbg Rel^(^). If we adopt a 
semantic approach, one need only show that, if Jl is a model for NBG, in 
the usual sense of ‘model’, then the objects X of Jl that satisfy the wf X C H 
also form a model for NBG. In addition, one can verify that (Reg) holds in 
this model; this is essentially just part (a) of Proposition 4.45. A direct 
consequence of this fact is that, if NBG is consistent, then so is the theory 
obtained by adding (Reg) as a new axiom. That (Reg) is independent of 
NBG (that is, cannot be proved in NBG) can be shown by means of a model 
that is somewhat more complex than the one given above for the consistency 
proof (see Bernays, 1937-1954, part VII). Thus, we can consistently add 
either (Reg) or its negation to NBG, if NBG is consistent. Practically the 
same arguments show the independence and consistency of (Reg) with re- 
spect to NBG + (AC). 

4.87 Consider the model whose domain is H a and whose interpretation of 6 
is E fj at the membership relation restricted to H a . Notice that the ‘sets’ of this 
model are the sets of rank ^ D a and the ‘proper classes’ are the sets of rank 
a'. Show that the model H K satisfies all axioms of NBG (except possibly the 
axioms of infinity and replacement) if and only if Lim(a). Prove also that H a 
satisfies the axiom of infinity if and only if a > 0 co. 

4.88 Show that the axiom of infinity is not provable from the other axioms 
of NBG, if the latter form a consistent theory. 

4.89 Show that the replacement axiom (R) is not provable from the other 
axioms (T, P, N, (Bl) (B7), U, W, S) if these latter form a consistent theory. 

4.90 An ordinal a such that H a is a model for NBG is called inaccessible. 
Since NBG has only a finite number of proper axioms, the assertion that a is 
inaccessible can be expressed by the conjunction of the relativization to H a 
of the proper axioms of NBG. Show that the existence of inaccessible or- 
dinals is not provable in NBG if NBG is consistent. (Compare Shepherdson 
(1951-53), Montague and Vaught (1959), and, for related results, Bernays 
(1961) and Levy (I960).) Inaccessible ordinals have been shown to have 
connections with problems in measure theory and algebra (see Ulam, 1930; 
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Zeeman, 1955; Erdos and Tarski, 1961).+ The consistency of the theory 
obtained from NBG by adding an axiom asserting the existence of an in- 
accessible ordinal is still an open question. More about inaccessible ordinals 
may be found in Exercise 4.91. 


The axiom of choice turns out to be consistent and independent with 
respect to the theory NBG + (Reg). More precisely, if NBG is consistent 
AC is an undecidable sentence of the theory NBG + (Reg). In fact, Godel 
(1938; 1939; 1940) showed that, if NBG is consistent, then the theory NBG 
+ (AC) + (Reg) + (GCH) is also consistent, where (GCH) stands for the 
generalized continuum hypothesis '. : 


(Vx)(Inf(x) =*- ->(3y)(x &{x))) 


(Our statement of Godel’s result is a bit redundant, since 
Hnbg (GCH) (AC) has been proved by Sierpinski (1947) and Specker 
(1954). This result will be proved below.) The unprovability of AC from 
NBG + (Reg), if NBG is consistent, has been proved by P.J. Cohen (1963— 
64), who also has shown the independence of the special continuum hy- 
pothesis , 2 C} = CO], in the theory NBG + (AC) + (Reg). Expositions of the 
work of Cohen and its further development can be found in Cohen (1966) 
and Shoenfield (1971b), as well as in Rosser (1969) and Feigner (1971a). For 
a thorough treatment of these results and other independence proofs in set 
theory, Jech (1978) and Kunen (1980) should be consulted. 

We shall present here a modified form of the proof in Cohen (1966) of 
Sierpinski’s theorem that GCH implies AC. 


DEFINITION 

For any set v, let & 0 (v) = v ) ^ >1 (v) = ^(d),^ 2 (d) = v )), . . . ,& >k+1 (v) 
= &>(0*(v)) for all km co. 


LEMMA 4.46 

If co^v, then £P k (v) + c k (v ) — ^(v) for all k^ 0 1. 


+ Inaccessible ordinals are involved also with attempts to provide a suitable set- 
theoretic foundation for category theory (see MacLane, 1971; Gabriel, 1962; Sonner, 
1962; Kruse, 1966; Isbell, 1966). 
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Remember that x ) = 2 X (see Exercise 4.40). From co=^r we obtain 
cq^^(v) for all k in co. Hence, #*(*;) + c l = ^ k (v) for all k in co, by Ex- 
ercise 4.64(g). Now, for any k > c 1, 

&*(v) + c v ) = 0*{v) x2 = ^(^*"‘( v )) x 2 =* 2^" 1(B) x 2 
^ 2^ 1(B) x 2' £* 2 ! ^" 1 ( 0 +ci ^ (») ^ #(#■ •(„)) = op k ( v ) 

LEMMA 4.47 

If y + c x = ^(x + c x), then ^(x) =^y. 


Proof 

Notice that 3P{x + c x) = 2 x+cX = 2 X x 2 X == 3P (x) x £P(x). Let y* — y x {0} 
and x* = x x {1}. Since y + c x = £P{x + c x) = 3P{x) x ^(x), there is a func- 
tion / such that y* U x*^^(x) x 0*(x). Let h be the function that takes each u 
in x* into the first component of the pair f‘u. Thus, h: x* =>- 8P(x). By 
Proposition 4.25(a), there must exist c in 0*(x) — /7 <l (x*). Then, for all z in 
0>{x), there exists a unique v in y* such that /‘u = (c,z). This determines a 
one-one function from ^(x) into y. Hence, 3?(x) =^y. 


PROPOSITION 4.48 
Assume GCH. 

(a) If u cannot be well-ordered and u + c u = u and p is an ordinal such that 
/?=^2", then P^u. 

(b) The axiom of choice holds. 

Proof 

(a) Notice that u + c u = u implies 1 + c u = u, by Exercise 4.71(b). Therefore, 
by Exercise 4.55(i), 2" + c u = 2 U . Now, u^p T c 2"+ c u = 2". By GCH, 
either (i) u = p+ c u or (ii) p+ c u = 2 U . If (ii) holds, 
P + c u = 2 U + c u = &{u + c u )' Hence, by Lemma 4.47, £^(^)=^/? and, 
therefore, u^p. Then, since u would be equinumerous with a subset of an 
ordinal, u could be well-ordered, contradicting our assumption. Hence, (i) 
must hold. But then, P^P + c u = u. 

(b) We shall prove AC by proving the equivalent sentence (WO) asserting 
that every set can be well-ordered. To that end, consider any set x and 
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assume, for the sake of contradiction, that x cannot be well-ordered. Let 
v = 2 xU0) . Then Hence, by Lemma 4.46, 0* (v) + c f v \ 

= 0> *(u ) for all k^ 0 1- Also, since x^x U co^v -< ^(v) -< -< . 

and x cannot be well-ordered, each 3^{v) cannot be well-ordered, for k ^ o 
Let ft = Jf ‘v. We know that by Corollary 4.32. Hence, by part (a), 

with u = 0* 3 (v), we obtain Using part (a) twice more (successively 

with u = & 2 {v ) and u — v )), we obtain = P^v. But this contradicts 
the definition of as the least ordinal not equinumerous with a subset 
of v. 


Exercise 

4.91 An a-sequence is defined to be a function / whose domain is a. If the 
range of / consists of ordinals, then / is called an ordinal a-sequence and, if 
in addition, ft < 0 y < 0 a implies /‘/? < 0 //, then / is called an 
increasing ordinal a-sequence. By Proposition 4.12, if / is an increasing 
ordinal a-sequence, then (J(/“a) is the least upper bound of the range of/. 
An ordinal 5 is said to be regular if, for any increasing ordinal a-sequence 
such that a < 0 <5 and the ordinals in the range of / are all 
<o U(T«) +o 1 A Non-regular ordinals are called singular ordinals. 

(a) Which finite ordinals are regular? 

(b) Show that ojq is regular and co^ is singular 

(c) Prove that every regular ordinal is an initial ordinal. 

(d) Assuming the axiom of choice (AC), prove that every ordinal of the 
form co v+0 i is regular. 

(e) If <D a is regular and Lim(a), prove that co a = a. (A regular ordinal co a 
such that Lim(a) is called a weakly inaccessible ordinal.) 

(f) Show that, if co K has the property that y < Q co a implies &(y) -< co tt , then 
Lim(a). The converse is implied by the generalized continuum hypoth- 
esis. A regular ordinal such that a > G 0 and such that y < 0 co K implies 
£P(y) -< (D a , is called strongly inaccessible. Thus, every strongly inacces- 
sible ordinal is weakly inaccessible and, if (GCH) holds, the strongly 
inaccessible ordinals coincide with the weakly inaccessible ordinals. 

(g) (Sheperdson 1951-53; Montague and Vaught, 1959) (i) If y is inacces- 
sible (i.e., if H y is a model of NBG), prove that y is weakly inaccessible. 
(ii) D In the theory NBG + (AC), show that y is inaccessible if and only 
if y is strongly inaccessible. 

(h) If NBG is consistent, then in the theory NBG + (AC) + (GCH), show 
that it is impossible to prove the existence of weakly inaccessible or- 
dinals. 
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4.6 OTHER AXIOMATIZATIONS OF SET THEORY 


We have chosen to develop set theory on the basis of NBG because it is 
relatively simple and convenient for the practising mathematician. There 
are, of course, many other varieties of axiomatic set theory, of which we will 
now make a brief survey. 


Morse-Kelley (MK) 

Strengthening NBG, we can replace axioms (B1)-(B7) by the axiom schema: 

( □) (3F) (V.y) (xeY& 0S{x)) 

where 0ft {pc) is any wf (not necessarily predicative) of NBG and Y is not free 
in 0ft (x). The new theory MK, called Morse-Kelley set theory , became well- 
known through its appearance as an appendix in a book on general to- 
pology by Kelley (1955). The basic idea was proposed independently by 
Mostowski, Quine, and Morse (whose rather unorthodox system may be 
found in Morse (1965)). Axioms (B1)-(B7) follow easily from (□) and, 
therefore, NBG is a subtheory of MK. Mostowski (1951a) showed that, if 
NBG is consistent, then MK is really stronger than NBG. He did this by 
constructing a ‘truth definition’ in MK on the basis of which he proved 
I— mk ^WnbG) where ^^nbg is a standard arithmetic sentence asserting the 
consistency of NBG. On the other hand, by Godel’s second theorem, 
^^nbg is not provable in NBG if the latter is consistent. 

The simplicity and power of schema (□) make MK very suitable for use 
by mathematicians who are not interested in the subtleties of axiomatic set 
theory. But this very strength makes the consistency of MK a riskier gamble. 
However, if we add to NBG + (AC) the axiom (In) asserting the existence 
of a strongly inaccessible ordinal 6, then Ho is a model of MK. Hence, MK 
involves no more risk than NBG + (AC) + (In). 

There are several textbooks that develop axiomatic set theory on the basis 
of MK (Rubin, 1967; Monk, 1980; Chuquai, 1981). Some of Cohen’s in- 
dependence results have been extended to MK by Chuquai (1972). 


Exercises 

4.92 Prove that axioms (Bl) (B7) are theorems of MK. 

4.93 Verify that, if 6 is a strongly inaccessible ordinal, then Ho is a model of 
MK. 
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The earliest axiom system for set theory was devised by Zermelo (1908). The 
objects of the theory are thought of intuitively as sets, not the classes of 
NBG or MK. Zermelo ’s theory Z can be formulated in a language that 
contains only one predicate letter e. Equality is defined extensionally: x = y 
stands for (Vz)(z 6x^z6 y). The proper axioms are: 


T: x — y => (x e z o y G z) 

P: (3z)(Vz/)(m 6 z «=y u = x V u = y) 

N: (3x)(Vy)0£x) 

U: (3y)(Vw)(« (3t>)(w GnAuG x)) 

W: (3y)(yu)(u e y <=> u C x) 


( substitutivity of =) 

( pairing ) 
{null set ) 
{sum set) 
{power set) 


\ / \ / \ - — ' u 

S *: (3y)(V«)(w ey {u £ xA ${u))), where 38{u) is any wf not containing; 

free ( selection ) 

I: (3x)(0 € X A (Vz) (z e X => z U {z} ex)) ( infinity ) 


Here we have assumed the same definitions of C,0, u and {«} as in 
NBG. 

Zermelo’s intention was to build up mathematics by starting with a few 
simple sets (0 and co) and then constructing further sets by various well- 
defined operations (such as formation of pairs, unions and power sets). In 
fact, a good deal of mathematics can be built up within Z. However, 
Fraenkel (1922a) observed that Z was too weak for a full development of 
mathematics. For example, for each finite ordinal n, the ordinal co + 0 n can 
be shown to exist, but the set A of all such ordinals cannot be proved to 
exist, and, therefore, co -f 0 co, the least upper bound of A, cannot be shown 
to exist. Fraenkel proposed a way of overcoming such difficulties, but his 
idea could not be clearly expressed in the language of Z. However, Skolem 
(1923) was able to recast Fraenkel’ s idea in the following way: for any wf 
88{x,y), let Fun(^) stand for (Vx) (Vn) (Vr) {&{x, u) A &{x, v) =>- u = v). Thus, 
Fun {(ffl) asserts that (A determines a function. Skolem’s axiom schema of 
replacement can then be formulated as follows: 

(R # ) Fun(^) =>• (Vw)(3z)(Vr)(r £z« (3w)(« e w /\2${u,v))) 

for any wf &{x,y) 

This is the best approximation that can be found for the replacement axiom 
R of NBG. 

The system Z + {R^) is denoted ZF and is called Zermelo-Fraenkel set 
theory. In recent years, ZF is often assumed to contain a set-theoretic reg- 
ularity axiom (Reg*): x f 0 =£- (3y) (y 6 x Ay fix = 0). The reader should 
always check to see whether (Reg*) is included within ZF. ZF is now the 
most popular form of axiomatic set theory; most of the modern research in 
set theory on independence and consistency proofs has been carried out with 
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respect to ZF. For expositions of ZF, see Krivine (1971), Suppes (1960), 
Zuckerman (1974), Levy (1978) and Hrbacek and Jech (1978). 

ZF and NBG yield essentially equivalent developments of set theory. 
Every sentence of ZF is an abbreviation of a sentence of NBG since, in 
NBG, lower-case variables x,y,z, ... serve as restricted set variables. Thus 
axiom N is an abbreviation of (3x)(M(x) A (Vy)(M(y) => y£x)) in NBG. It 
is a simple matter to verify that all axioms of ZF are theorems in NBG. 
Indeed, NBG was originally constructed so that this would be the case. We 
can conclude that, if NBG is consistent, then so is ZF. In fact, if a con- 
tradiction could be derived in ZF, the same proof would yield a con- 
tradiction in NBG. 

The presence of class variables in NBG seems to make it much more 
powerful than ZF. At any rate, it is possible to express propositions in NBG 
that either are impossible to formulate in ZF (such as the universal choice 
axiom) or are much more unwieldy in ZF (such as transfinite induction 
theorems). Nevertheless, it is a surprising fact that NBG is no riskier than 
ZF. An even stronger result can be proved: NBG is a conservative extension 
of ZF in the sense that, for any sentence $ of the language of ZF, if Fnbg 
then hzF $ (see Novak (Gal) 1951; Rosser and Wang, 1950; Shoenfield, 
1954). This implies that, if ZF is consistent, then NBG is consistent. Thus, 
NBG is consistent if and only if ZF is consistent, and NBG seems to be no 
stronger than ZF. However, NBG and ZF do differ with respect to the 
existence of certain kinds of models (see Montague and Vaught, 1959). 
Moreover, another important difference is that NBG is finitely axiomatiz- 
able, whereas Montague (1961a) showed that ZF (as well as Z) is not finitely 
axiomatizable. Montague (1961b) proved the stronger result that ZF cannot 
be obtained by adding a finite number of axioms to Z. 


Exercise 

4.94 Let H* = (J H a (see page 281). 

(a) Verify that H* consists of all sets of rank less than a. 

(b> if a is a limit ordinal > G co, show that H* is a model for Z. 

(c) D Find an instance of the axiom schema of replacement (R#) that is false 

in H* i+o0r [Hint: Let t%(x,y) be x € co A y=co+ Q x. Observe that 
co + 0 co£H* + o<0 and co + c co = (J{r | (3 u)(u e co A .%(u, r))}.] 

(d) Show that, if ZF is consistent, then ZF is a proper extension of Z. 


The theory of types (ST) 

Russell’s paradox is based on the set K of all those sets that are not members 
of themselves: K = (x | x£x\. Clearly, K e K if and only if K<£K. In NBG 
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this argument simply shows that AT is a proper class, not a set. In ZF the 
conclusion is just that there is no such set K. 

Russell himself chose to find the source of his paradox elsewhere. He 
maintained that xex and xfix should be considered ‘illegitimate’ and 
‘ungrammatical’ formulas and, therefore, that the definition of K makes no 
sense. However, this alone is not adequate because paradoxes analogous to 
Russell’s can be obtained from slightly more complicated circular prop- 
erties, like x e y A y e x. 


Exercise 

4.95 (a) Derive a Russell-style paradox by using x G y A y e x. 

(b) Use x G j’i A yi G yj A ... A y„-\ G y„ A y„ ex to obtain a paradox, 
where n > 1 . 

Thus, to avoid paradoxes, one must forbid any kind of indirect circu- 
larity. For this purpose, we can think of the universe divided up into types in 
the following way. Start with a collection W of non-sets or individuals. The 
elements of W are said to have type 0. Sets whose members are of type 0 are 
the objects of type 1. Sets whose members are of typel will be the objects of 
type 2, and so on. 

Our language will have variables of different types. The superscript of a 
variable will indicate its type. Thus, x° is a variable of type 0, y 1 is a variable of 
type 1, and so on. There are no variables other than type variables. The atomic 
wfs are of the formje" G y” +1 , where n is one of the natural numbers 0, 1 , 2, . . . . 
The rest of the wfs are built up from the atomic wfs by means of logical 
connectives and quantifiers. Observe that -i(x G x) and -i(x,G y Ay ex) are 
not wfs. 

The equality relation must be defined piecemeal, one definition for each 
type. 


DEFINITION 

x n — /' for (Vz' H 1 )(;i n G z" +1 f Gz" +1 ) Notice that two objects are de- 
fined to be equal if they belong to the same sets of the next higher type. The 
basic property of equality is provided by the following axiom scheme. 


ST1 (EXTENSIONALITY AXIOM) 

(Vx")(x" g/ +i o/cz' ,+l )^y HI =z" +1 

This asserts that two sets that have the same members must be equal. On the 
other hand, observe that the property of having the same members could 
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not be taken as a general definition of equality because it is not suitable for 
objects of type 0. 

Given any wf $(x n ), we wish to be able to define a set |x" | ^?(x")}. 


ST2 (COMPREHENSION AXIOM SCHEME) 

For any wf &(x n ), the following wf is an axiom: 

(3/ +, )(Va J7 )(x" c y +l o &{?)) 

Here, y' H 1 is any variable not free in ?A{x"). If we use the extensionality 
axiom, then the set y” +1 asserted to exist by axiom ST2 is unique and can be 
denoted by {x" | ${x n ) } . 

Within this system, we can define the usual set-theoretic notions and 
operations, as well as the natural numbers, ordinal numbers, cardinal 
numbers and so on. However, these concepts are not unique but are re- 
peated for each type (or, in some cases, for all but the first few types). For 
example, the comprehension scheme provides a null set A' Hl = 
[x J1 | x" 7 ^ a"} for each non-zero type. But there is no null set per se. The 
same thing happens for natural numbers, n type theory, the natural numbers 
are not defined as they are in NBG. Here they are the finite cardinal 
numbers. For example, the set of natural numbers of type 2 is the inter- 
section of all sets containing {A 1 } and closed under the following successor 
operation: the successor S(y 2 ) of a set y 2 is {r 1 | (3w 1 )(3z°) (w 1 e 
y 2 Az°^[/ A d 1 = w 1 U {z 0 })}. Then, among the natural numbers of type 2, 
we have 0 = {A 1 }, 1 = S(0), 2 = S(l), and so on. Here, the numerals 0, 1,2, 
. . . should really have a superscript 2 to indicate their type, but the super- 
scripts were omitted for the sake of legibility. Note that 0 is the set of all sets 
of type 1 that contain no elements, 1 is the set of all sets of type 1 that 
contain one element, 2 is the set of all sets of type 1 that contain two 
elements, and so on. 

This repetition of the same notion in different types makes it somewhat 
inconvenient for mathematicians to work within a type theory. Moreover, it 
is easy to show that the existence of an infinite set cannot be proved from the 
extensionality and comprehension schemas.^ To see this, consider the 
‘model’ in which each variable of type n ranges over the sets of rank less 
than or equal to n + G 1 . (There is nothing wrong about assigning overlap- 
ping ranges to variables of different types.) 

We shall assume an axiom that guarantees the existence of an infinite set. 
As a preliminary, we shall adopt the usual definition { {a" } , {a" , y” } } of the 
ordered pair: (x",y”), where {a" , y" } stands for (w" | u 11 = a" V t/ 1 = y”}. 

I This fact seemed to undermine Russell’s doctrine of logicism, according to 

which all of mathematics could be reduced to basic axioms that were of an essentially 
logical character. An axiom of infinity could not be thought of as a logical truth. 
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Notice that (x w ,y ^ ) is of type n + 2. Hence, a binary relation on a set A, being 
a set of ordered pairs of elements of A , will have type 2 greater than the type 
of A. In particular, a binary relation on the universe V 1 = {jc° | x 0 = *9) 
all objects of type 0 will be a set of type 3. 


ST3 (AXIOM OF INFINITY) 

(3x 3 )([(3i/ , )Pi/ , )«i/ , ,i/ , > eJ)]A 

(V«°)(Vr 0 )(Vw 0 )({i/ 0 ,i/ 0 ) ^ x 3 A lP) ex 3 A (v°, tv°) Gx 3 ^ 
ex 3 ] A [{m°, t>°) Gx 3 ^ (3z°)({r°,z°) Gx 3 )])) 

This asserts that there is a non-empty irreflexive, transitive binary relation x 3 
on V 1 such that every member of the range of x 3 also belongs to the domain 
of x 3 . Since no such relation exists on a finite set, V x must be infinite. 

The system based on ST1 ST3 is called the simple theory of types and is 
denoted ST. Because of its somewhat complex notation and the repetition of 
concepts at all (or, in some cases, almost all) type levels, ST is not generally 
used as a foundation of mathematics and is not the subject of much con- 
temporary research. Suggestions by Turing (1948) to make type theory more 
usable have been largely ignored. 

With ST we can associate a first-order theory ST*. The non-logical 
constants of ST* are e and monadic predicates T„ for each natural number 
n. We then translate any wf of ST into ST* by replacing subformulas 
(Vx")#(x") by (\/x)(T„(x) => ^(x")) and, finally, if y j \ . . . ,y jk are the free 
variables of prefixing to the result 7), (yi ) A ... A ?)*(>&) => and changing 
each yj‘ into y,-. In a rigorous presentation, we would have to specify clearly 
that the replacements are made by proceeding from smaller to larger sub- 
formulas and that the variables x,y \, . . . ,y^ are new variables. The axioms of 
ST* are the translations of the axioms of ST. Any theorem of ST translates 
into a theorem of ST*. 


Exercise 

4.96 Exhibit a model of ST* within NBG. 

By virtue of Exercise 4.96, NBG (or ZF) is stronger than ST: (1) any 
theorem of ST can be translated into a corresponding theorem of NBG; and 
(2) if NBG is consistent, so is ST.t 

To provide a type theory that is easier to work with, one can add axioms 
that impose additional structure on the set V 1 of objects of type 0. For 

I A stronger result was proved by John Kemeny (1949) by means of a truth 
definition within Z: if Z is consistent, so is ST. 
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example, Peano’s axioms for the natural numbers were adopted at level 0 in 
GodeFs system P, for which he originally proved his famous incompleteness 
theorem (see Godel, 1931). 

In Principia Mathematica (1910-1913), the three-volume work by Alfred 
North Whitehead and Bertrand Russell, there is a theory of types that is 
further complicated by an additional hierarchy of orders. This hierarchy was 
introduced so that the comprehension scheme could be suitably restricted in 
order not to generate an impredicatively defined set, that is, a set A defined by 
a formula in which some quantified variable ranges over a set that turns out 
to contain the set A itself. Along with the mathematician Henri Poincare, 
Whitehead and Russell believed impredicatively defined sets to be the root 
of all evil. However, such concepts are required in analysis (for example, in 
the proof that any non-empty set of real numbers that is bounded above has 
a least upper bound). Principia Mathematica had to add the so-called axiom 
of reclucibility to overcome the order restrictions imposed on the compre- 
hension scheme. The Whitehead-Russell system without the axiom of re- 
ducibility is called ramified type theory ; it is mathematically weak but is of 
interest to those who wish an extreme constructivist approach to mathe- 
matics. The axiom of reducibility vitiates the effect of the order hierarchy; 
therefore, it is much simpler to drop the notion of order and the axiom of 
reducibility. The result is the simple theory of types ST, which we have 
described above. 

In ST, the types are natural numbers. For a smoother presentation, some 
logicians allow a larger set of types, including types for relations and/or 
functions defined on objects taken from previously defined types. Such a 
system may be found in Church (1940). 

Principia Mathematica must be read critically; for example, it often 
overlooks the distinction between a formal theory and its metalanguage. 
The idea of a simple theory of types goes back to Ramsey (1925) and, 
independently, to Chwistek (1924—25). Discussions of type theory are found 
in Andrews (1986), Hatcher (1982) and Quine (1963). 


Quine’s theories NT and ML 

Quine (1937) invented a type theory that was designed to do away with some 
of the unpleasant aspects of type theory while keeping the essential idea of 
the comprehension axiom ST2. Quine’s theory NF (New Foundations) uses 
only one kind of variable x,y,z, ... and one binary predicate letter €. 
Equality is defined as in type theory: x — y stands for (Vz)(x Gz^ygz). 
The first axiom is familiar: 




NF1 (EXTENSIONALITY) 

(Vz)(z =» x = y 

In order to formulate the comprehension axiom, we introduce the notion 
of stratification. A wf $ is said to be stratified if one can assign integers to 
the variables of 2A so that: (1) all occurrences of the same free variable are 
assigned the same integer; (2) all bound occurrences of a variable that are 
bound by the same quantifier must be assigned the same integer; and (3) for 
every subformula x £ y of the integer assigned to y is 1 greater than the 
integer assigned to x. 

Examples 

1. (3y)(* Gy Ay G z) V u G x is stratified by virtue of the assignment indi- 
cated below by superscripts: 

(By 2 )^ 1 ey 2 Ay 2 e z 3 ) V iP e x l 

2. ((3y)(x Gy)) A (3y) (y G x) is stratified as follows: 

((3/K* 1 e/)) A (3/Vex 1 ) 

Notice that the ys in the second conjunct do not have to have the same 
integers assigned to them as the ys in the first conjunct. 

3. xGyVyGxis not stratified. If x is assigned an integer n, then the first y 
must be assigned n + 1 and the second y must be assigned n — 1, con- 
tradicting (1). 

NF2 (COMPREHENSION) 

For any stratified wf ^?(x), 

( 3y)(\/x)(x E y ty $(x)) 

is an axiom. (Here, y is assumed to be the first variable not free in 3?(x).) 

Although NF2 is an axiom scheme, it turns out that NF is finitely axi- 
omatizable (Hailperin, 1944). 

Exercise 

4.97 Prove that equality could have been defined as follows: x — y for 
(Vz)(x Gz^j’Gz) (More precisely, in the presence of NF2, this definition is 
equivalent to the original one.) 

The theory of natural numbers, ordinal numbers and cardinal numbers is 
developed in much the same way as in type theory, except that there is no 
longer a multiplicity of similar concepts. There is a unique empty set 
A = {x | x x} and a unique universal set V — {x | jc -= *}. We can easily 
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prove V E V, which immediately distinguishes NF from type theory (and 
from NBG, MIC and ZF). 

The usual argument for Russell’s paradox does not hold in NF, since 
X £ x is not stratified. Almost all of standard set theory and mathematics is 
derivable in NF; this is done in full detail in Rosser (1953). However, NF 
has some very strange properties. First of all, the usual proof of Cantor’s 
theorem, A -< &{. A ), does not go through in NF; at a key step in the proof, a 
set that is needed is not available because its defining condition is not 
stratified. The apparent unavailability of Cantor’s theorem has the desirable 
effect of undermining the usual proof of Cantor’s paradox. If we could 
prove A -< (A {A), then, since 0*(V) = V, we could obtain a contradiction 
from V -< 0t(V). In NF, the standard proof of Cantor’s theorem does yield 
USC(/4) -< A ), where USC(T) stands for {* | (3t/)(*/ eA Ax = {«})}. If 
we let A = V, we conclude that U SC ( V) -< V. Thus, V has the peculiar 
property that it is not equinumerous with the set of all unit sets of its 
elements. In NBG, the function f, defined by f(u) — {«} for all u in A, 
establishes a one-one correspondence between A and U SC (A ) for any set /l. 
However, the defining condition for / is not stratified, so that / may not 
exist in NF. If / does exist, A is said to be strongly Cantorian. 

Other surprising properties of NF are the following. 

1. The axiom of choice is disprovable in NF (Specker, 1953). 

2. Any model for NF must be non-standard in the sense that a well-ordering 
of the finite cardinals or of the ordinals of the model is not possible in the 
metalanguage (Rosser and Wang, 1950). 

3. The axiom of infinity is provable in NF (Specker, 1953). 

Although property 3 would ordinarily be thought of as a great advan- 
tage, the fact of the provability of an axiom of infinity appeared to many 
logicians to be too strong a result. If that can be proved, then probably 
anything can be proved, that is, NF is likely to be inconsistent. In addition, 
the disprovability of the axiom of choice seems to make NF a poor choice 
for practising mathematicians. However, if we restrict attention to so-called 
Cantorian sets, sets A for which A and USC(/4) are equinumerous, then it 
might be consistent to assume the axiom of choice for Cantorian sets and to 
do mathematics within the universe of Cantorian sets. 

NF has another attractive feature. A substantial part of category theory 
(see MacLane, 1971) can be developed in a straightforward way in NF, 
whereas this is not possible in ZF, NBG or MIC. Since category theory has 
become an important branch of mathematics, this is a distinct advantage for 
NF. 

If the system obtained from NF by assuming the existence of an inac- 
cessible ordinal is consistent, then ZF is consistent (see Orey, 1956a; Collins 
1955). If we add to NF the assumption of the existence of an infinite strongly 
Cantorian set, then Zermelo’s set theory Z is consistent (see Rosser, 1954). 
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The question of whether the consistency of ZF implies the consistency of NF 
is still open (as is the question of the reverse implication). 

Let ST - be the simple theory of types ST without the axiom of infinity 
Given any closed wf 38 of ST, let 38 + denote the result of adding l to the 
types of all variables in 38. Let SP denote the theory obtained from ST~ by 
adding as axioms the wfs 38 38 + for all closed wfs 38. Specker (1958; 1962) 

proved that NF is consistent if and only if SP is consistent. 

Let NFU denote the theory obtained from NF by restricting the ext- 
ensionality axiom to non-empty sets: 

NF1* (3u)(u G x) A (Vz)(z 6x^>z£y)=^x = y 


Jensen (1968 -69) proved that NFU is consistent if and only if ST is con- 
sistent, and the equiconsistency continues to hold when both theories are 
supplemented by the axiom of infinity or by axioms of infinity and choice. 

Discussions of NF may be found in Hatcher (1982) and Quine (1963). 
Forster (1983) gives a survey of more recent results. 

Quine also proposed a system ML that is formally related to NF in much 
the same way that MIC is related to ZF. The variables are capital italic 
letters X, 7, Z , . . . ; these variables are called class variables. We define 
M(X),X is a set) by (37) (X E T), and we introduce lower-case italic letters 
x,y,z,... as variables restricted to sets. Equality is defined as in NBG: X — Y 
for (VZ) (X G 1 Z E f). Then we introduce an axiom of equality: 

ML1: X = 7AXgZ=*7gZ 


There is an unrestricted comprehension axiom scheme: 

ML2: (37) (Vx) (x G 7 38{x ) ) 

where 38{x) is any wf of ML. Finally, we wish to introduce an axiom that has 
the same effect as the comprehension axiom scheme NF2: 

ML3: (Vy,) . . . (Vy„)(3z)(V*)(x Gz« 38{x)) 

where 38 (x) is any stratified wf whose free variables are . . . , y„ (n > 0) 
and whose quantifiers are set quantifiers. 

All theorems of NF are provable in ML. Hence, if ML is consistent, so is 
NF. The converse has been proved by Wang (1950). In fact, any closed wf of 
NF provable in ML is already provable in NF. 

ML has the same advantages over NF that MIC and NBG have over ZF: 
a greater ease and power of expression. Moreover, the natural numbers of 
ML behave much better than those of NF; the principle of mathematical 
induction can be proved in full generality in ML. 


t Quine uses the word ‘element’ instead of ‘set’. 
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get theory with urelements 

The theories NBG, MIC, ZF, NF and ML do not allow for objects that are 
not sets or classes. This is all well and good for mathematicians, since only 
sets or classes seem to be needed for dealing with mathematical concepts and 
problems. However, if set theory is to be a part of a more inclusive theory 
having to do with the natural or social sciences, we must permit reference to 
things like electrons, molecules, people, companies, etc., and to sets and 
classes that contain such things. Things that are not sets or classes are 
sometimes called urelements 3 We shall sketch a theory UR similar to NBG 
that allows for the existence of urelements.^ Like NBG, UR will have a finite 
number of axioms. 

The variables of UR will be the lower-case Latin boldface letters 
xj, X 2 , . . . . (As usual, let us use x, y, z, . . . to refer to arbitrary variables.) In 
addition to the binary predicate letter A\ there will be a monadic predicate 
letter A\. We abbreviate T^x, y) by x G y, -^(x, y) by x ^ y, and A\ (x) by 
Cls (x). (Read ‘Cls(x)’ as ‘x is a class’.) To bring our notation into line with 
that of NBG, we shall use capital Latin letters as restricted variables for 
classes. Thus, (\/X)&(X) stands for (Vx) (Cls (x) => &(x)), and (3X)&(X) 
stands for (3x) (Cls(x) A 38(x)). Let M(x) stand for Cls(x) A (3y(x £ y), and 
read ‘M(x)’ as ‘x is a set’. As in NBG, use lower-case Latin letters as 
restricted variables for sets. Thus, (Vx)^(x) stands for (Vx) (M(x) => £^(x)), 
and (3x)&(x) stands for (3x) (M(x) A^(x)). Let Pr(x) stand for Cls(x) A 
-M(x), and read ‘Pr(x)’ as ‘x is a proper class’. Introduce Ur(x) as an 
abbreviation for ->Cls(x), and read ‘Ur(x)’ as ‘x is an urelement’. Thus, the 
domain of any model for UR will be divided into two disjoint parts con- 
sisting of the classes and the urelements, and the classes are divided into sets 
and proper classes. Let El(x) stand for M(x) V Ur(x), and read ‘El(x)’ as x is 
an element’. In our intended interpretation, sets and urelements are the 
objects that are elements (i.e., members) of classes. 


t Quine’s earlier version of ML, published in 1940, was proved inconsistent by 
Rosser (1942). The present version is due to Wang (1950). 

t‘Ur’ is a German prefix meaning primitive, original or earliest. The words 
‘individual’ and ‘atom’ are sometimes used as synonyms for ‘urelement’. 

§ Zermelo’s 1908 axiomatization permitted urelements. Fraenkel was among the 
first to draw attention to the fact that urelements are not necessary for mathematical 
purposes (see Fraenkel, 1928, pp. 355f). Von Neumann’s (1925; 1928) axiom systems 
excluded urelements. 
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Exercise 

4.98 Prove: buR (Vx)(El(x) ->Pr(x)). 

We shall define equality in a different way for classes and urelements 

DEFINITION x = y is an abbreviation for: 

[Cls(x) A Cls(y) A (Vz)(z Gx«>z6y)]v [Ur(x) A Ur(y) A (Vz)(x 6 z <=*• y e 2 ji 

Exercise 

4.99 Prove :byR (Vx)(x = x). 

AXIOM UR1 

(Vx)(Ur(x) =» (Vy)(y £ x)] 

Thus, urelements have no members. 

Exercise 

4.100 Prove: buR (Vx)(Vy)(x G y =4> Cls(y) A El(x)). 


AXIOM UR2 

(VY)(VT)(VZ)(Y = rAleZ^fGZ) 


Exercise 

4.101 Show: 

(a) b UR (Vx)(Vy)(x = y => (Vz)(z G x o z G y)) 

(b) l- U R ( Vx) (Vy) (x = y=>(Vz)(xGz«ye z)) 

(c) buR (Vx)(Vy)(x = y => [Cls(x) & Cls(y)] A [Ur(x) Ur(y)]A 
M(x) M(y)]) 

(d) byR (Vx) (Vy) [x = y => (^(x, x) => ^(x, y))], where &(x, y) arises from 
^?(x,x) by replacing some, but not necessarily all, free occurrences ofx 
by y, with the proviso that y is free for x in ^(x, x). 

(e) UR is a first-order theory with equality (with respect to the given 
definition of equality). 




OTHER AXIOMATIZATIONS OF SET THEORY 



Xl'bM UR3 (NULL SET) 


(3x)(Vy)(y£x) 

This tell us that there is a set that has no members. Of course, all urelements 
Iso have no elements. 


Exercise 

4,102 Show: 

* (3jx)(Vy)(y ^ x). On the basis of this exercise we can introduce a new 
individual constant 0 satisfying the condition M(0) A (Vy)(y^0). 


AXIOM UR4 (PAIRING) 

(Vx) (Vy) (El(x) A El(y) => (3z)(Vu)(u E z <=> [u = xVu = y]) 

Exercise 

4.112 Prove: I-ur (Vx)(Vy)(3iz)([El(x) A El(y) A (Vu)(u Ez^[u = xVu 
- y]) V [(-iEl(x) V -iEl(y)) A z = 0]) 

On the basis of this exercise we can introduce the unordered pair notation 
jx> y}. When x and y are elements, {x, y} is the set that has x and y as its 
only members; when x or y is a proper class, {x, y) is arbitrarily chosen to be 
the empty set 0. As usual, the singleton notation {x} stands for {x,x}. 


DEFINITION (ORDERED PAIR) 

Let (x, y) stand for {{x}, {x, y}}. As in the proof of Proposition 4.3, one can 
.show that, for any elements x,y, u,v, (x,y) = (u,v) [x = u Ay = v]. Or- 
dered n-tuples can be defined as in NBG. 

The class existence axioms B 1— B7 of NBG have to be altered slightly by 
sometimes replacing universal quantification with respect to sets by uni- 
versal quantification with respect to elements. 


AXIOMS OF CLASS EXISTENCE 

(UR5) (ELY) ( Vu) (Vv) (El (u) A El(v) [{u, v) Gl^uGv]) 

(UR6) (VX)(VF)(3Z)(Vu)(u gZ^uEIAugL) 

(UR7) (VX) (3Z) (Vu) (El (u) [uG/^-u^ X}) 

(UR8) (VX)(3Z)(Vu)(El(u) ^(uGZ^ (3v)((u,v) Gl)) 
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(UR9) (VX) (3Z) (Vu) (Vv) (El(u) A El(v) ^ «u,v) EZ<^uEX)) 

(UR10) (VX) (3Z) (Vu) (Vv) (Vw) (El (u) A El(v) 

A El(w) => [(u, v, w ) GZ<^ (v, w,u) € X]) 

(URI1) (VX)(3Z)(Vu)(Vv)(Vw)(El(u) A El(v) A El(w) 

[(u, v, w) GZ«> (u, w, v) e X]) 

As in NBG, we can prove the existence of the intersection, complement 
and union of any classes, and the existence of the class V of all elements. Bui 
in UR we also need an axiom to ensure the existence of the class of all 
sets, or, equivalently, of the class V m of all urelements. 


AXIOM UR 12 


(5LT)(Vu)(ueX^Ur(u)) 

This yields the existence of V UT and implies the existence of F M , that is- 
(3X)(Vu)(u gI^M(u)). The class Vei of all elements is then the union 
Ku-Ufivr. Note that this axiom also yields (3X)(Vu)(El(u) =>- [u EX 
Cls(u)]), since I'm can be taken as the required class X. 

As in NBG, we can prove a general class existence theorem. 


Exercise 

4.104. Let <p(x t , . . . , x„, y t , . . . , y m ) be a formula in which quantification 
takes place only with respect to elements, that is, any subformula (Vu)^has 
the form (Vu)(El(u) => ( i). Then 

Eur (3Z)(Vxj) . . . (Vx„)(El(xj) A ... A El(x n ) => 

«xi, . . . ,x„) GZ-» fp{x\ , . . . , x„, y t , . . . , yj]). 

The sum set, power set, replacement and infinity axioms can be translated 
into UR. 


AXIOM UR13 


(Vx) (3y) (Vu) (u (3v)(u 6 v A v Ex)) 


AXIOM UR14 


(Vx) (3_y) (Vu) (uGj^uCx) 

where uCx stands for M(u) A M(x) A (Vv)(v E u => v E x). 
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(VT)(Vx)(Un(T) => (3y) (Vu) [uG)^ (3v)({v,u) G Y A v ex)]) 

where Un(z) stands for (Vxi)(Vx 2 )(Vx 3 )[E1(xi) A El(x 2 ) A El(x 3 ) =>- «xi,x 2 ) 

6 z A {xi, x 3 )z => x 2 = x 3 )] 


AXIOM UR16 


(3x)(0 G x A (Vu)( u Gj:^hU { u } G x)) 

From this point on, the standard development of set theory including the 
theory of ordinal numbers, can be imitated in UR. 


iPROPOSITION 4.49 
NBG is a subtheory of UR. 


Proof 

It is easy to verify that every axiom of NBG is provable in UR, provided 
that we take the variables of NBG as restricted variables for ‘classes’ in UR. 
The restricted variables for sets in NBG become restricted variables for ‘sets’ 
in UR.* 

PROPOSITION 4.50 

UR is consistent if and only if NBG is consistent. 

Proof 

By Proposition 4.49, if UR is consistent, NBG is consistent. For the con- 
verse, note that any model of NBG yields a model of UR in which there are 
no urelements. In fact, if we replace ‘Cls(x)’ by the NBG formula ‘x = x’, 
then the axioms of UR become theorems of NBG. Hence, a proof of a 
contradiction in UR would produce a proof of a contradiction in NBG. 

The axiom of regularity (Reg) takes the following form in UR. 

Bn fact, a formula (Vx)$?(x) in NBG is an abbreviation in NBG for 
(VA r )((37)(Y G Y) =» &(X)). The latter formula is equivalent in UR to 
(Vx)(M(x) => ^(x)), which is abbreviated as (Vx)^(x) in UR. 
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(Reg UR ) iyX){X ^ 0 =*► (3u)(u e X A -i(3v)(v elAvG u))) 

It is clear that an analogue of Proposition 4.49 holds: UR + (Reg^) i s aj . 
extension of NBG + (Reg). Likewise, the argument of Proposition 4,$(i 
shows the equiconsistency of NBG + (Reg) and UR -f- (Reg,j R ). 

Since definition by transfinite induction (Proposition 4.14(b)) holds in 
UR, the cumulative hierarchy can be defined 

¥‘0=3 0 

¥‘(a') = 0»(¥‘a) 

Lim(2) => ¥‘2 3= (J ¥*/? 

/?<„A 

and the union H — U(¥“On) is the class of ‘pure’ sets in UR and forms a 
model of NBG + (Reg). In NBG, by Proposition 4.45, (Reg) is equivalent 
to V = //, where V is the class of all sets. 

If the class V m of urelements is a set, then we can define the following bv 
transfinite induction: 

E‘0 = V ut 
E‘(a') = ^(E‘a) 

Lim(A) =»S‘2 = S‘/? 

P<o* 

The union H ur — (J(E u On) is a model of UR + (Regim), and (Reg lR ) holds 
in UR if and only if H m is the class % of all elements. 

If the class V UI of urelements is a proper class, it is possible to obtain an 
analogue of H m in the following way. For any set x whose members are 
urelements and any ordinal y, we can define a function 5 E£ by transfinite 
induction up to y: 

E£‘0 — x 

s£ V) = 

Lim(A) => E£‘2 = [J S£‘/? 

P<JL 

Let be the class of all elements v such that, for some x and y, v is in the 
range of E£. Then determines a model of UR + (Reg lJR ), and, in UR, 
(Regjj R ) holds if and only if is the class of all elements. 

The equiconsistency of NBG and UR can be strengthened to show the 
following result. 


if a' < 0 y 
if A < 0 y 


PROPOSITION 4.51 

If NBG is consistent, then so is the theory UR -f (Reg ur ) + t V m is denu- 
merable’. 
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proof 

Within NBG one can define a model with domain to that is a model of NBG 
ivithout the axiom of infinity. The idea is due to Ackermann (1937). For any 
a nd tn in to, define m G n to mean that 2 m occurs as a term in the expansion 
0 f /; as a sum of different powers of 2d If we take ‘^4-sets’ to be members of to 
and ‘proper /I -classes’ to be infinite subsets of to, it is easy to verify all 
axioms of NBG + (Reg) except the axiom of infinity d (See Bemays (1954, 
pp. 81-82) for a sketch of the argument.) Then we change the '’membership’ 
relation on to by defining m n to mean that 2 m Gn. Now we define a ‘set’ to 
either 0 or a member n of to for which there is some m in to such that 
Hi £i n. We take the ‘urelements’ to be the members of to that are not ‘sets’. 
For example, 8 is an ‘urelement’, since 8 = 2 3 and 3 is not a power of 2. 
Other small ‘urelements’ are 1, 9, 32, 33 and 40. In general, the ‘urelements’ 
are sums of one or more distinct powers 2 k , where k is not a power of 2. The 
‘proper classes’ are to be the infinite subsets of to. Essentially the same 
argument as for Ackennann’s model shows that this yields a model Ji of all 
axioms of UR + (Reg lR ) except the axiom of infinity. Now we want to 
extend Ji to a model of UR. First, let r stand for the set of all finite subsets 
of to that are not members of to, and then define by transfinite induction the 
following function 0. 

©‘0 = co 

©‘(O') = £^(©‘a) — r 
Lim(2) =» ©‘A = (J &p 

Let Hb = U(©“0«). Note that Hb contains no members of r. Let us define a 
membership relation 6* on //b. For any members x and y of //#, define 
x £* y to mean that either x and y are in co and x G\ y, or y £ co and x Gy. 
The ‘urelements’ will be those members of to that are the ‘urelements’ of Ji. 
The ‘sets’ will be the ordinary sets of Hb that are not ‘urelements’, and the 
‘proper classes’ will be the proper classes of NBG that are subclasses of Hb- 
It now requires a long careful argument to show that we have a model of 
UR + (Reg UR ) in which the class of urelements is a denumerable set. 

A uniform method for constructing a model of UR + (Reg lJR ) in which 
the class of urelements is a set of arbitrary size may be found in Brunner 
(1990, p.65)d If AC holds in the underlying theory, it holds in the model as 
well. 


iThis is equivalent to the statement that the greatest integer k such that k ■ 2'" <73 
is odd. 

*For distinct natural numbers 73 j , . . . , n *, the role of the finite set {«| , . . . , 73*} is 
played by the natural number 2" 1 + . . . + 2" k . 

^Brunner attributes the idea behind the construction to J. Truss. 
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The most important application of axiomatic set theories with urelements 
used to be the construction of independence proofs. The first independence^ 
proof for the axiom of choice, given by Fraenkel (1922b), depended essen- 
tially on the existence of a denumerable set of urelements. More precise 
formulations and further developments may be found in Lindenbaum and' 
Mostowski (1938) and Mostowski (1939).^ Translations of these proofs into 
set theories without urelements were found by Shoenfield (1955), Mendelson 
(1956b) and Specker (1957), but only at the expense of weakening the axiom 
of regularity. This shortcoming was overcome by the forcing method of 
Cohen (1966), which applies to theories with (Reg) and without urelements 


tFor more information about these methods, see Levy (1965), Pincus (1972), 
Howard (1973) and Brunner (1990). 




5.1 ALGORITHMS. TURING MACHINES 

An algorithm is a computational method for solving each and every problem 
from a large class of problems. The computation has to be precisely specified 
so that it requires no ingenuity for its performance. The familiar technique 
for adding integers is an algorithm, as are the techniques for computing the 
other arithmetic operations of subtraction, multiplication and division. The 
truth table procedure to determine whether a statement form is a tautology 
is an algorithm within logic itself. 

It is often easy to see that a specified procedure yields a desired algorithm. 
In recent years, however, many classes of problems have been proved not to 
have an algorithmic solution. Examples are: 

1. Is a given wf of quantification theory logically valid? 

2. Is a given wf of formal number theory S true (in the standard interpre- 
tation)? 

3. Is a given wf of S provable in S? 

4. Does a given polynomial f{x \ , . . . ,x„) with integral coefficients have 
integral roots (Hilbert’s tenth problem)? 

In order to prove rigorously that there does not exist an algorithm for 
answering such questions, it is necessary to supply a precise definition of the 
notion of algorithm. 

Various proposals for such a definition were independently offered in 
1936 by Church (1936b), Turing (1936-37), and Post (1936). All of these 
definitions, as well as others proposed later, have been shown to be equiv- 
alent. Moreover, it is intuitively clear that every procedure given by these 
definitions is an algorithm. On the other hand, every known algorithm falls 
under these definitions. Our exposition will use Turing’s ideas. 

First of all, the objects with which an algorithm deals may be assumed to 
be the symbols of a finite alphabet A = {ao,ai, . . . ,a„}. Non-symbolic 
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Figure 5.1 


objects can be represented by symbols, and languages actually used for 
computation require only finitely many symbols.! 

A finite sequence of symbols of a language A is called a word of A. It is 
convenient to admit an empty word A consisting of no symbols at all. If p 
and Q are words, then PQ denotes the word obtained by writing Q to the* 
right of P. For any positive integer k , P A shall stand for the word made up of 
k consecutive occurrences of P. 

The work space of an algorithm often consists of a piece of paper or a 
blackboard. However, we shall make the simplifying assumption that all 
calculations take place on a tape that is divided into squares (see Figure 5.1), 
The tape is potentially infinite in both directions in the sense that, although 
at any moment it is finite, more squares always can be added to the right- 
and left-hand ends of the tape. Each square contains at most one symbol of 
the alphabet A. At any one time, only a finite number of squares contain 
symbols, while the rest are blank. The symbol ao will be reserved for the 
content of a blank square. ( In ordinary language, a space is sometimes used 
for the same purpose.) Thus, the condition of the tape at a given moment 
can be represented by a word of A; the tape in Figure 5.1 is a 2 a 0 a 5 ai. Our 
use of a one-dimensional tape does not limit the algorithms that can be 
handled; the information in a two-dimensional array can be encoded as a 
finite sequence.! 

Our computing device, which we shall refer to as a Turing machine , works 
in the following way. The machine operates at discrete moments of time, not 
continuously. It has a reading head which, at any moment, will be scanning 
one square of the tape. (Observation of a larger domain could be reduced to 
consecutive observations of individual squares.) The device then reacts in 
any of four different ways: 

1. It prints a symbol in the square, erasing the previous symbol. 

2. It moves to the next square to the right. 

3. It moves to the next square to the left. 

4. It stops. 


fff a language has a denumerable alphabet (ao, ai, . . .}, then we can replace it by 
the alphabet {b, *}. Each symbol a„ of the old alphabet can be replaced by the 
expression b* ■ • • *, consisting of b followed by n occurrences of *. 

-fThis follows from the fact that there is an effective one-one correspondence 
between the set of pairs of natural numbers and the set of natural numbers. For the 
details, see pp. 183-4. 
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What the machine does depends not only on the observed symbol but also 
ron the internal state of the machine at that moment (which, in turn, depends 
pn the previous steps of the computation and on the structure of the ma- 
chine). We shall make the plausible assumption that a machine has only a 
finite number of internal states {q 0 ,qi, . . . , q w }. The machine will always 
begin its operation in the initial state q 0 . 

A step in a computation corresponds to a quadruple of one of the fol- 
lowing three forms: (1) q y a,a^q,.; (2) q 7 a,-Rq # .; (3) q 7 a,-Lq,.. In each case, q, is 
The present internal state, a, is the symbol being observed, and q,. is the 
internal state after the step. In form (1), the machine erases a, and prints a*. 
In form (2), the reading head of the machine moves one square to the right, 
and, in form (3), it moves one square to the left. We shall indicate later how 
the machine is told to stop. 

Now we can give a precise definition. A Turing machine with an alphabet 
A of tape symbols {ao, ai , . . . , a n } and with internal states (q 0 , q i , • - • , q„, } is 
a finite set ST of quadruples of the forms (1) q^-a^q,., (2) q^a/Rq,., and (3) 
q^a/Lq,- such that no two quadruples of ST have the same first two symbols. 

Thus, for fixed q a,-, no two quadruples of types (1), (2) and (3) are in 
This condition ensures that there is never a situation in which the machine is 
instructed to perform two contradictory operations. 

The Turing machine ST operates in accordance with its list of quadruples. 
This can be made precise in the following manner. 

By a tape description of ST we mean a word such that: (1) all symbols in 
the word but one are tape symbols; (2) the only symbol that is not a tape 
symbol is an internal state q,; and (3) q, is not the last symbol of the word. 

A tape description describes the condition of the machine and the tape at 
a given moment. When read from left to right, the tape symbols in the 
description represent the symbols on the tape at that moment, and the tape 
symbol that occurs immediately to the right of q ; - in the tape description 
represents the symbol being scanned by the reading head at that moment. If 
the internal state q ; - is the initial state q 0 , then the tape description is called 
an initial tape description. 

Example 

The tape description a 2 aoq 1 a 0 aiai indicates that the machine is in the in- 
ternal state q l5 the tape is as shown in Figure 5.2, and the reading head is 
scanning the square indicated by the arrow. 

We say that ST moves one tape description a. into another one p (ab- 
breviated oc-»P) if and only if one of the following is true. 

1. a is of the form Pq y a,Q, p is of the form Pq r a*Q, and q^a/a^q,. is one of the 

quadruples of STd 

tHere and below, P and Q are arbitrary (possibly empty) words of the alphabet 
of 
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Figure 5.2 


2. a is of the form Pa^a/Q, (3 is of the form Pq,.a. T a/Q, and q •a / -Lq / . is one of 
the quadruples of dT . 

3. a is of the form q y a,Q, P is of the form q r aoa,-Q, and q ; a,Lq,. is one of the 
quadruples of dT . 

4. a is of the form Pq^-aA-Q, p is of the form Pa,q,.aAQ, and q^a/Rq,. is one of 
the quadruples of . 

5. a is of the form Pq 7 a,, p is of the form Pa,q r a 0 , and q 7 a,Rq ; . is one of the 
quadruples of & . 

According to our intuitive picture, ‘ fT moves a into p’ means that, if the 
condition at a time t of the Turing machine and tape is described by a, then 
the condition at time t -f- 1 is described by p. Notice that, by clause 3, 
whenever the machine reaches the left-hand end of the tape and is ordered to 
move left, a blank square is attached to the tape on the left; similarly, by 
clause 5, a blank square is added on the right when the machine reaches the 
right-hand end and has to move right. 

We say that stops at tape description a if and only if there is no tape 
description P such that a-»p. This happens when q 7 a, occurs in a but q.a,- is 
not the beginning of any quadruple of 

A computation of dT is a finite sequence of tape descriptions 
c*o, . . . , a* (k > 0) such that the following conditions hold. 

1 . o<o is an initial tape description, that is, the internal state occurring in a is q 0 . 

2. a ; — » a/ + i for O^i < k 

3. dT stops at a*. 

This computation is said to begin at «o and end at a a- If there is a compu- 
tation beginning at ao, we say that fT is applicable to oto. 

The algorithm Alg^r determined by dT is defined as follows: 

For any words P and Q of the alphabet A of 5", Alg^(P) = Q if and 
only if there is a computation of dT that begins with the tape de- 
scription q 0 P and ends with a tape description of the form R 1 q y R 2 , 
where Q = Ri R 2 . 

This means that, when dT begins at the left-hand end of P and there is 
nothing else on the tape, dT stops with Q as the entire content of the tape. 
Notice that Alg^r need not be defined for certain words P. An algorithm 
Alg^r determined by a Turing machine ,T is said to be a Turing algorithm. 
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Example 

In any computation of the Turing machine given by 

q 0 aoRqo, qo a i a oqi ; qo a 2 a oqi j • • ■ i qo a « a oqi 

0~ locates the first non-blank symbol (if any) at or to the right of the square 
scanned at the beginning of the computation, erases that symbol, and then 
stops. If there are only blank squares at or to the right of the initial square, 
keeps on moving right for ever. 

Let us now consider computations of number-theoretic functions. For 
convenience, we sometimes will write | instead of a! and B instead of ao. 
(Think of B as standing for ‘blank’.) For any natural number k, its tape 
representation k will stand for the word | A+1 , that is, the word consisting of 
Jc+ 1 occurrences of |. Thus, 0=|,1 = ||,2 = |||, and so on. The reason why 
we represent k by k + 1 occurrences of | instead of k occurrences is that we 
wish 0 to be a non-empty word, so that we will be aware of its presence. The 
tape representation (k\,k2 , . . . ,k n ) of an //-tuple of natural numbers 
{k\ , &2, . . . , k„) is defined to be the word /qB A^B-B/q,. For example, 
PTToTS) is III |B||B|B| f ill |. 

A Turing machine ST will be thought of as computing the following 
partial function fr,\ of one variable.* 

fr,\{k) = m if and only if the following condition holds: Alg ^(k) is 
defined and Alg $r(k) = Ei m E2, where Ei and E2 are certain (possibly 
empty) words consisting of only Bs (blanks). 

The function f$-,\ is said to be Turing-computable. Thus, a one-place partial 
function / is Turing-computable if and only if there is a Turing machine 
such that / = fy, 1 . 

For each n > 1 , a Turing machine ST also computes a partial function 
f f n of n variables. For any natural numbers k\, ... ,k„: 

f 3r.1t (ki, • • • , K) — m if and only if the following condition holds: 

Alg^r ( (£1 , #2 , . . . ,k„)) is defined and Alg^^Aq,^, . . . ,k„)) = Ei m E2, 
where Ei and E2 are certain (possibly empty) words consisting of only Bs 
(blanks). 

The partial function is said to be Turing-computable. Thus, an //-place 
partial function / is Turing-computable if and only if there is a Turing 
machine ST such that f — f ^, n . 

Notice that, at the end of a computation of a value of a Turing-com- 
putable function, only the value appears on the tape, aside from blank 
squares at either or both ends, and the location of the reading head does not 
matter. Also observe that, whenever the function is not defined, either the 

* Remember that a partial function may fail to be defined for some values of its 
argument. Thus, a total function is considered to be a special case of a partial 
function. 
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Turing machine will never stop or, if it does stop, the resulting tape is not of 
the appropriate form Ej m E 2 . 

Examples 

1. Consider the Turing machine 5", with alphabet {B, |}. defined by 
q 0 |Lq 1; qjB|q 2 . .T computes the suc cessor function N(x), since 
q 0 ^q, BAr-^q 2 /c+ 1, and ST stops at q 2 k + 1. Hence N(x) is Turing- 
computable. 

2. The Turing machine defined by 

qol B qi>qi BR qo>qo B lq2 

computes the zero function Z{x). Given k on the tape, moves right, 
erasing all |s until it reaches a blank, which it changes to a|. So, 0 is the 
final result. Thus, Z(x) is Turing-computable. 

3. The addition function is computed by the Turing machine defined by 
the following seven quadruples: 

qolBqo. qoBRqi , qi l R qi> qi B lq 2 , q 2 l R q 2 , q2 BL lq3> q3l fi q3 

In fact, for any natural numbers m and n, 

0.M = q 0 l"' +, B|"+ , tf q„BrB|" + UBq 1 |”'Br +l 

l/ ^ 

-» - • - -^Brq,Br ,+1 - > >Brq 2 |r +1 ^ • - - 

r f J- & 

-*>B |™ |' ,+2 q 2 B-^-B | m+ ” + 1 q 3 1 B-^»B |" ,+,,+1 qj BB = B^T+7iq 3 BB 
and IT stops at B w -Fi7q 3 BB. 

> il 

Exercises 

5.1 Show that the function U 2 such that U 2 (x\,x 2 ) = x 2 is Turing-com- 
putable. 

5.2 (a) What function /(*i,* 2 ,* 3 ) is computed by the following Turing 
machine? 

qollqi»qil B qoi qo BR qi . qi br Q2 , 

q2l R q2, q2 BR q3i q3l B q 4 > q 4 BR qs 

(b) What function / (x) is computed by the following Turing machine? 

qol B qi.qi BR q2>q2 B (q2 

5.3 (a) State in plain language the operation of the Turing machine, de- 
scribed in Example 3, for computing the addition function. 

(b) Starting with the tape description q 0 1 1 1 B 1 1 1 1 , write the sequence of 
tape descriptions that make up the computation by the addition machine of 
Example 3. 

5.4 What function f{x) is computed by the following Turing machine? 
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qolRq, q 4 |Rq4 q 6 B|q 0 

qi|Bq 2 q 4 B|q 5 q^lq, 
q 2 BRq 3 q 5 |Lq 5 q 7 |Lq 7 

q 3 |Rq 3 q 5 BL q6 q?BRq 8 

q 3 BRq 4 q 6 |Lq 6 q g |Bq g 

:5.5 Find a Turing machine that computes the function sg(x). (Recall that 
, s g(0) = 0 and sg(x) = 1 for x > 0.) 

5.6° Find Turing machines that compute the following functions. 

Ta) x—y (Remember that x—y — x — y if x > y, and x—y = 0 if x < >>.) 

(b) [x/2] (Recall that [x/2] is the greatest integer less than or equal to x/2. 
Thus, [x/2] = x/2 if x is even, and [x/2] = (x — l)/2 if x is odd.) 

(c) x • y, the product of x and y. 

5.7 If a function is Turing-computable, show that it is computable by 
infinitely many different Turing machines. 


5.2 DIAGRAMS 

Many Turing machines that compute even relatively simple functions (like 
multiplication) require a large number of quadruples. It is difficult and 
tedious to construct such machines, and even more difficult to check that 
they do the desired job. We shall introduce a pictorial technique for con- 
structing Turing machines so that their operation is easier to comprehend. 
The basic ideas and notation are due to Hermes (1965). 

1. Let . . . , be any Turing machines with alphabet A = 
{a 0 , aj , . . . , a*}. 

2. Select a finite set of points in a plane. These points will be called vertices. 

3. To each vertex attach the name of one of the machines ST \ , . . . , &~ r . 
Copies of the same machine may be assigned to more than one vertex. 

4. Connect some vertices to others by arrows. An arrow may go from a 
vertex to itself. Each arrow is labelled with one of the numbers 0, 1, . . . , k. 
No two arrows that emanate from the same vertex are allowed to have 
the same label. 

5. One vertex is enclosed in a circle and is called the initial vertex. 

The resulting graph is called a diagram. 

Example 
See Figure 5.3. 

We shall show that every diagram determines a Turing machine whose 
operation can be described in the following manner. Given a tape and a 
specific square on the tape, the Turing machine of the initial vertex V of the 
diagram begins to operate, with its reading head scanning the specified 
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Figure 5.3 



square of the tape. If this machine finally stops and the square being scanned 
at the end of the computation contains the symbol a,, then we look for an 
arrow with label i emanating from the vertex V. If there is no such arrow 
the computation stops. If there is such an arrow, it leads to a vertex to which 
another Turing machine has been assigned. Start that machine on the tape 
produced by the previous computation, at the square that was being scan- 
ned at the end of the computation. Repeat the same procedure that was just 
performed, and keep on doing this until the machine stops. The resulting 
tape is the output of the machine determined by the diagram. If the machine 
never stops, then it is not applicable to the initial tape description. 

The quadruples for this Turing machine can be specified in the following 
way. 

1 . For each occurrence in the diagram of a machine ^ write its quadruples, 
changing internal states so that no two machine occurrences have an 
internal state in common. The initial vertex machine is not to be changed. 
This retains q 0 as the initial internal state of the machine assigned to the 
initial vertex. For every other machine occurrence, the original initial 
state q 0 has been changed to a new internal state. 

2. If an occurrence of some ^Hs connected by an arrow — * to some then, 

for every (new) internal state q^ of that occurrence of such that no 
(new) quadruple of begins with q^a,,, add the quadruple q^a^q,, 
where q, is the (new) initial state for Jy (Step 2 ensures that, whenever % 
stops while scanning a w , ^ will begin operating.) 

The following abbreviations are used in diagrams: 

0 1 k 

1. If one vertex is connected to another vertex by all arrows 
we replace the arrows by one unlabelled arrow. 

2. If one vertex is connected to another by all arrows except — we replace 
all the arrows by 5. 

3. Let \ 2 stand for ST \ —> OF i, let ff \ST i!T 3 stand for ST \ — ► 2 — ► ^3, 

and so on. Let ST 1 2 3 4 be ST ST, let !T Z be ST !T , and so forth. 

4. If no vertex is circled, then the leftmost vertex is to be initial. 

To construct diagrams, we need a few simple Turing machines as building 
blocks. 
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1. r (right machine). Let {ao,ai, . . . , a*} be the alphabet, r consists of the 
quadruples qoa/Rqj for all a,. This machine, which has k + 1 quadruples, 
moves one square to the right and then stops. 

2. 1 (left machine). Let {ao, aj, . . . , a*} be the alphabet. 1 consists of the 
quadruples q 0 a,Lqj for all a,-. This machine, which has k+ 1 quadruples, 
moves one square to the left and then stops. 

3. a j (constant machine) for the alphabet {ao, aj, . . . , a*}, aj consists of the 
quadruples q^-a^ for all a,-. This machine replaces the initial scanned 
symbol by a, and then stops. In particular, ao erases the scanned symbol, 
and a\ prints |. 

Examples of Machines Defined by Diagrams 

1. P (Figure 5.4) finds the first blank to the right of the initially scanned 
square. In an alphabet {ao, aj , . . . , a*}, the quadruples for the machine P 
are: q 0 a,Rqj for all a,, and q^-a,^ for all a,- ao- 


4 = 0 


— *► r 

Figure 5.4 


2. A (Figure 5.5) finds the first blank to the left of the initially scanned 
square. 


=4 0 


Figure 5.5 


Exercises 

5.8 Describe the operations of the Turing machines p (Figure 5.6) and k 
(Figure 5.7) and write the list of quadruples for each machine. 


o 


** r 


Figure 5.6 
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o 


Figure 5.7 

5.9 Show that machine S in Figure 5.8 searches the tape for a non-blank 
square. If there are such squares, S finds one and stops. Otherwise, S never 
stops. 


o 


r 



0 

&i pflof — ^ a | X. 9 q 


? 0 


I* ° 


p a 0 X X a© p 


Figure 5.8 


To describe some aspects of the operation of a Turing machine on part of 
a tape, we introduce the following notation: 



arbitrary symbol 

B . . . B 

sequence of blanks 

B . . . 

everything blank to the right 

. . . B 

everything blank to the left 

W 

non-empty word consisting of non-blanks 

X 

W 1 BW 2 B . . . W „(n > 1), a sequence of 
nonempty words of non-blanks, separated 
by blanks 


Underlining will indicate the scanned symbol. 


More Examples of Turing Machines Defined by Diagrams 
3. 8ft (right-end machine). See Figure 5.9. 

ffXBB => ^ X B B 

Squares on the rest of the tape are not affected. The same assumption is made 
in similar places below. When the machine 01 begins on a square preceding a 
sequence of one or more nonempty words, followed by at least two blank 
squares, it moves right to the first of those blank squares and stops. 

* o 


Pr i 

0 


Figure 5.9 
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4 . <£ (left-end machine) See Figure 5.10. 

BBX~ => BBX 




4 = 0 



0 

Figure 5.10 

5 . T (left-translation machine) See Figure 5.11.1 

~BWB => ~ WBB 

This machine shifts the whole word W one square to the left. 



Figure 5.11 

6 . a (shift machine). See Figure 5.12. 

B W i BW 2 B => BW 2 B . . . B 

In the indicated situation, Wi is erased and W 2 is shifted leftward so that it 
begins where W] originally began. 



&qT 



T 

Figure 5.12 


7. C (clean-up machine) See Figure 5.13. 

~ BBXBWB => -WB...B 

1 There is a separate arrow from r 1 2 to each of the groups on the right and a 

separate arrow from each of these, except lao, back to r 2 . 





T AIT 
Figure 5.13 


8. K (word-copier) See Figure 5.14. 

BWB. . . BWBWB . . . 



Figure 5.14 


9. K„ (//-shift copier) See Figure 5.15. 


BW„BW„_(B . . . WjB . . . =» BW„ BW„_ i B . . . W,BW„B . . . 



Figure 5.15 


Exercises 

5.10. Find the number-theoretic function f(x) computed by each of the 
following Turing machines. 

(a) lai 

(b) Figure 5.16 

(c) PKAa l A(ra 0 ) 2 

5.11. Verify that the given functions are computed by the indicated Turing 
machines. 
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Figure 5.16 


(a) \x-y\ (Figure 5.17) 



Figure 5.17 


(b) x + y Pai A(rao ) 2 

(c) x ■ y (Figure 5.18) 



PrajrC r — 1 aj A (r a 0 ) 2 P 



Figure 5.18 

5.12. Draw diagrams for Turing machines that will compute the following 
functions: (a) max(x, j) (b) min(x,y) (c) x—y (d) [x/2] 

5.13. Prove that, for any Turing machine with alphabet (a 0 , . . . , a*-}, there 
is a diagram using the Turing machines r, l, ao, . • . , that defines a Turing 
machine such that ST and Sf have the same effect on all tapes. (In fact, ff 
can be defined so that, except for two additional trivial initial moves left and 
right, it carries out the same computations as ,T.) 

5.3 PARTIAL RECURSIVE FUNCTIONS. 

UNSOLVABLE PROBLEMS 


Recall, from Section 3.3, that the recursive functions are obtained from the 
initial functions (the zero function Z(x), the successor function A (x), and the 
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projection functions Uf{x 1 , . . . ,x„)) by means of substitution, recursion and 
the restricted /(-operator. Instead of the restricted /(-operator, let us intro- 
duce the unrestricted p- operator : 

If /(*i , • • • i r h ) f.iy{cj{x "i, . . . i x r , f j j 0) 

= the least y such that g(x t . ,x„,y) = 0 
then / is said to arise from g by means of the unrestricted /(-operator. 

Notice that, for some the value f(x i, . . . ,x„) need not be defined 4 

this happens when there is no y such that g(x v , . . . ,x„,y) =0. 

If we replace the restricted /(-operator by the unrestricted /(-operator in 
the definition of the recursive functions, we obtain a definition of the partial 
recursive functions. In other words, the partial recursive functions are those 
functions obtained from the initial functions by means of substitution, re- 
cursion and the unrestricted /(-operator. 

Whereas all recursive functions are total functions, some partial recursive 
functions will not be total functions. For example, gy(x+y = 0) is defined 
only when x = 0. 

Since partial recursive functions may not be defined for certain arguments, 
the definition of the unrestricted /(-operator should be made more precise: 

Hy(g(x i, . . . , x„ , y) = 0) = k means that, for < k, 
g(x i , . . . ,x„,u) is defined and //(xi ,x n ,u) f 0, and 
g (x i , • * . , Xfi , y'j ' 0 . 

Observe that, if R(xi , . . . is a recursive relation, then 

py(R(x i, . . . , x„,t)) can be considered an admissible application of the un- 
restricted /(-operator. In fact, f.iy(R(x \, . . . = //>’(Q(xi , . . . 

= 0), where Cr is the characteristic function of R. Since R is a recursive 
relation, Cr is, by definition, a recursive function. 

Exercises 

5.14 Describe the following partial recursive functions. 

(a) py(x + y + 1 = 0) 

(b) f.iy(y > x) 

(c) gy{y + x=x) 

5.15 Show that all recursive functions are partial recursive. 

5.16 Show that every partial function whose domain is a finite set of natural 
numbers is a partial recursive function. 

It is easy to convince ourselves that every partial recursive function 
f(x is computable, in the sense that there is an algorithm that 
computes f(x i , . . . , x n ) when f(x i, . . . ,x„) is defined and gives no result when 
f(x i, . . . , x „ ) is undefined. This property is clear for the initial functions and 
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is inherited under the operations of substitution, recursion and the unre- 
stricted /i-operator. 

It turns out that the partial recursive functions are identical with the 
Turing-computable functions. To show this, it is convenient to introduce a 
different kind of Turing-computablility. 

A partial number-theoretic function f(x is said to be standard 
Turing-computable if there is a Turing machine such that, for any natural 
numbers ku . . . , k„, the following holds. 

Let B&iBA^B . . . B k n be called the argument strip) Notice that the ar- 
gument strip is B (£ 1 , . . . Take any tape containing the argument 
strip but without any symbols to the right of it. (It may contain 
symbols to the left.) The machine is begun on this tape with its 
reading head scanning the first | of k\. Then: 

1. stops if and only if f(k\,... ,k„) is defined. 

2. If stops, the t ape contains the same argument strip as before, followed 
by B/(/q , . . . k„). Thus, the final tape contains 

Bk\ BA 2 B . . . BA„B/(^i , • - - , k„) 

Moreover: 

3. The reading head is scanning the first | of f(k\, . . . ,k„). 

4. There is no non-blank symbol on the tape to the right of f(k\,... ,k„ ). 

5. During the entire computation, the reading head never scans any square 
to the left of the argument strip. 

For the sake of brevity, we shall say that the machine described above 
ST-computes the function f(x\ , . . . ,x H ). 

Thus, the additional requirement of standard Turing computability is 
that the original arguments are preserved, the machine stops if and only if 
the function is defined for the given arguments, and the machine operates on 
or to the right of the argument strip. In particular, anything to the left of the 
argument strip remains unchanged. 


PROPOSITION 5.1 

Every standard Turing-computable function is Turing-computable. 

Proof 

Let J be a Turing machine that ST-computes a partial function 
f{x \, . . . ,x„). Then / is Turing-computable by the Turing machine «^~PC. In 

tFor a function of one variable, the argument strip is taken to be BA 1 . 
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fact, after ST operates, w e obtain BTiB . . . Bx„B/(xi , . . . , jc„ ) , with the reading 
head at the leftmost | of fix i, P then moves the reading head to the 

right of f(x i, . . . ,*b), and then C removes the original argument strip. 


PROPOSITION 5.2 

Every partial recursive function is standard Turing-computable. 

Proof 

(a) Prai ST-computes the zero function Z(x). 

(b) The successor function Nix) is ST-computed by PKajAr. 

(c) The projection function Uj l (x is ST-computed by 
&K„_ i+i Ar. 

(d) ( Substitution .) Let f(x i , . . . ,x„) = g(/n(xi, . . . , h m {x x , . . . 

and assume that ST-computes g and ^ ST-computes hj for 1 < j^im 

Let j be the machine (K„ +y ) " A ,! r . The reader should verify that 
/ is ST-computed by 

^',P(K n+ i) n A"r^2^3 - • • N, n - l N m Pa n A m r,T(T"'Ar 

We take adv anta ge of the ST -computability when, storing jci, . . . , * 
hi(xi, . . . ,x„), . . .,hj(xi,. . , , x„) on the tape, we place (x \, . . . ,x„) on the 
tape to the right and compute hi+i (xi, . . . ,jc„) without disturbing what 
we have stored on the left. 

(e) ( Recursion .) Let 

/(*1, ‘ ' ‘ >^®l 

/(^i 7 ■ ■ • j y "h 1 ) — ^C^'i 5 • • ■ t Xfi) f(xy , . . . , X /) ) y)) 

Assume that Sf ST-computes g find ST ST-computes h. Then the reader 
should verify that the machine in Figure 5.19 ST-computes /. 

(f) Unrestricted ^-operator . Let f{x i, . . .x„) = py(g(x i, . . . , x„,y) — 0) and 
assume that ST-computes g. Then the machine in Figure 5.20 ST- 
computes /. 



i+l 



Figure 5.19 
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1 

£3?raj A n+1 r^”r *• Prajrolaol 

0 

" 

la 0 !Ar 

Figure 5.20 


Exercise 

S.17 For a recursion of the form 

m = k 

f(y+l) = h(yj(y)) 

show how the diagram in Figure 5.19 must be modified. 


COROLLARY 5.3 

Every partial recursive function is Turing-computable. 

Exercise 

5.18 Prove that every partial recursive function is Turing-computable by a 
Turing machine with alphabet {a 0 ,ai}. 

In order to prove the converse of Corollary 5.3, we must arithmetize the 
language of Turing computability by assigning numbers, called Godel 
numbers , to the expressions arising in our study of Turing machines. ‘R’ and 
‘L’ are assigned the Godel numbers 3 and 5, respectively. The tape symbols 
a / are assigned the numbers 7 + 4/, while the internal state symbols q, are 
given the numbers 9 + 4 i. For example, the blank B, which is ao, receives the 
number 7; the stroke |, which is ai, has the number 11; and the initial 
internal state symbol q 0 has the number 9. Notice that all symbols have odd 
Godel numbers, and different symbols have different numbers assigned to 
them. 

As in Section 3.4, a finite sequence uo,ui, . . . , u* of symbols is assigned 
the Godel number p^ v °'p^ u '\ . . p e k ^ Uk \ where po, p\ , pj,--. are the prime 
numbers 2, 3, 5, . . . in ascending order and g(u,) is the Godel number as- 
signed to U;. For example, the quadruple q 0 aoaiq 0 receives the Godel 
number 2 9 3 7 5 n 7 9 . 

By an expression we mean a finite sequence of symbols. We have just 
shown how to assign Godel numbers to expressions. In a similar manner, to 
any finite sequence Eo,E}, . . . ,E,„ of expressions we assign the number 
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pf^pf^ . . .pm hn ‘\ For example, this assigns Godel numbers to finite se- 
quences of Turing machine quadruples and to finite sequences of tape de- 
scriptions. Observe that the Godel number of an expression is even and 
therefore, different from the Godel number of a symbol, which is odd. 
Moreover, the Godel number of a sequence of expressions has an even 
number as an exponent of po and is, therefore, different from the Godel 
number of an expression, which has an odd number as an exponent of » 0 
The reader should review Sections 3.3 and 3.4, especially the functions 
//£(x), (x) y ), and x*y. Assume that x is the Godel number of a finite se- 
quence w 0 , wi, . . . , w k ; that is, * = p^p^ . . .pf w,: \ where g(wj) is the 
Godel number of w y. Recall that £&{x) — k + 1, the length of the sequence 
and (x)j = g(wy), the Godel number of the yth term of the sequence. If in 
addition, y is the Godel number of a finite sequence Vo, Vj , . . . , v w , then x* y 
is the Godel number of the juxtaposition Wo, Wi , . . . , w k , vo, Vi , . . . , v,„ of the 
two sequences. 


PROPOSITION 5.4 

The following number-theoretic relations and functions are primitive re- 
cursive. In each case, we write first the notation for the relation or function, 
then, the intuitive interpretation in terms of Turing machines, and, finally, 
the exact definition. (For the proofs of primitive recursiveness, use Propo- 
sition 3.18 and various primitive relations and functions defined in Section 
3.3. At a first reading, it may be advisable to concentrate on just the intuitive 
meanings and postpone the technical verification until later.) 

IS(x): x is the Godel number of an internal state symbol q i( : 

= 9 + Au) 

Sym(x): jc is the Godel number of an alphabet symbol a u : 

( 3ll )u<x( X = 7 + 4w ) 

Quad(x): x is the Godel number of a Turing machine quadruple: 

U(x) = 4 A IS((*) 0 ) A Sym((x) l ) A IS((*) 3 ) 

A [Sym((*) 2 ) V (xj 2 = 3 V (xj 2 = 5] 

TM(x): x is the Godel number of a Turing machine (in the form of a finite 
sequence of appropriate quadruples): 

Q uac K (*)„ ) A A- > 1 A (Vk) 

u <£/*(x) (Vr) 

v<£/i(x) ^ 

=» [((*).)» + ((*).)» v ((*).), t (W„),] 

TD(x): x is the Godel number of a tape description: 

* > 1 A (Vi/) I(< ^)[IS(W I( ) V Sym((x) t< )] a (3im) ii < m(x) i S((x) u ) 

a =► ii + 1 < m(x)) 
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Cons(.x,p,z): x and y are Godel numbers of tape descriptions a and p, and 
z is the Godel number of a Turing machine quadruple that transforms a 
into P: 

TD(*) A TD(y) A Quad(z) A (3w) M< , /( , K1 [IS((>;) w ) 

A (- x ')„, = (z)o A (*)«-+ 1 = ( z ) i A 
j f ([Sym((z) 2 ) A C v) w+ i - (z) 2 A G0 W = (z) 3 A «(*) = U(y) 

\ HV“)u<em( u / rv Au/w+\=> (x) u = (y).)]V 

r [( z ) 2 = 3 A (y)w = (*)>*+ 1 A (>’)„■+ 1 = ( z ) 3 a 
I (Vw)„<£/?(.r)(w ^WAI/^W+U (y)„ = (*) U )A 

| ([w + 2 < &J(*) a M{y) = U(x)] V [w + 2 = £4(x)A 

< M(y) = M(x) + l a (y) w+2 = 7])]v 

' [(z) 2 = 5 /\ {[w i-- 0 A (y) w ^ { = (z) 3 A (y) w = (x) wJ ., 

A M(y) = M(x) A (V«) u<fli(jc) (u A u 

(y)u = Wj] v t w = 0 A (y)o = (z) 3 A (y), = 7 a 
k ^0) = M(x) + 1 A (Vi/) 0 <„<^(.v)0 ; ) I( +i = (*)«]}])] 

I corresponds to a quadruple q y a,a^q r ,II to a quadruple q / a,Rq r , and III 
to a quadruple q.-a,Lq,.. 

NTD (x) : x is the Godel number of a numerical tape description — that is, 
a tape description in which the tape has the form E1AEI2, where each of Ej 
and E 2 is empty or consists entirely of blanks, and the location of the 
reading head is arbitrary: 

TD(x) A (Vw)„ <£4t) (Sym((x) M ) =► (*)„ = 7 V (x) tl = 11) 

A (V ll )u<£h( X ) (y v )v<m*) ( yw \v<mx) ( ll < VA »< w A (*)«, = 11 A 

(x) w = 11 =► (x) v ± 7) (3u) u<eh{x) ((x) tl = 11) 

Stop(x,z): z is the Godel number of a Turing machine and x is the 
Godel number of a tape description a such that stops at a: 

TM(z) A TD(x) A -(3 W ), (<W (,)[IS((x)J A (3r)^ /(2) (((z)„) 0 

= (*)« A (( Z )Jl = (*),/+ 1 )] 

Comp(y,z): z is the Godel number of a Turing machine ST and y is the 
Godel number of a computation of 2 T\ 

}’> 1 A TM(z) A (V«)„ < ^ w TD((y)J A Stop ((y)tt(y)-i, z ) A 
(V«) u <£/(>’) - 1 ( 3 ^) vv <^( 2 ) Co ns ( (y ) I( , (y)«+i> (z)„,) A 

(Vt7) „ <£y?(( >; ’) 0 ) (IS ( ( ) 0 )d ) ^ (Wo)!) = 

Num(r): The Godel number of the word x — that is, of |*. 

Num(x) = JJpJ 1 

U^X 
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TR(xi, . . . ,x„): The Godel number of the tape representation (xjVTTT'vJ' 
of the /7-tuple (xj, . . . ,x„): 

TR(xi , . . . ,x„) = Num(xi ) * 2 7 * Num(x 2 ) * 2 7 * • • • * 2 7 * Num(x„) 

V (;') : If y is the Godel number of a computation that results in a nu- 
merical tape description, then JJ{y>) is the number represented on that 
final tape.! 

m = \ £ ^d(0'Wi).-ni)]-i 

[Let w be the number, represented by | u , on the final tape. The cal- 
cualtion of U{y) tallies a 1 for every stroke | that appears on the final 
tape. This yields a sum of w + 1, and then 1 is subtracted to obtain w.] 
T„(z,x j, . . . ,x„,y): y is the Godel number of a computation of a Turing 
machine with Godel number z such that the computation begins on the 
tape (xj, . . . ,x„) , with the reading head scanning the first | in xj , and ends 
with a numerical tape description: 

Comp(y,z) A (y) 0 =2% TR(*i , . . . ,x„) a NTD((y)^ (y) ^ 1 ) 

When n=l, replace TR(xi, . . . ,x„) by Num(xi). (Observe that, if 
T„(z,x i, . . . and T„(z,x h . . . ,x„,^), then y\ =y 2 , since there is at 

most one computation of a Turing machine starting with a given initial 
tape.) 

PROPOSITION 5.5 

If ST is a Turing machine that computes a number-theoretic function 
/(x 1? . . . ,x„) and e is a Godel number of ST , then! 

f(x i, . . . ,x„) = U(nyT„(e,x u . . . ,x,„v)) 

Proof 

Let ki , . . . ,k n be any natural numbers. Then k„) is defined if and 

only if there is a computation of £T beginning with (k\, . . . ,k„) and ending 
with a numerical tape description - that is, if and only if 
(3y)T„{e, k\, . . . >x n ,y). So, /(£[, . . . ,k„) is defined if and only if 
f.iyT„(e,ki, . . . ,k n ,y) is defined. Moreover, when f(k\, . . . , k n ) is defined, 

Uf y is not the Godel number of a computation that yields a numerical tape 
description, U (y) is defined, but its value in such cases will be of no significance. 

1 Remember that an equality between two partial functions means that, when- 
ever one of them is defined, the other is also defined and the two functions have the 
same value. 
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uyT„( e,k i, >>) is the Godel number of a computation of S' beginning 

with and U (f.iyT„(e , k \ , . . . , k fn y)) is the value yielded by the 

computation, namely, f(k\, . . . ,k„). 


COROLLARY 5.6 

Every Turing-computable function is partial recursive. 

proof 

Assume /(vi, . . . ,x„) is Turing-computable by a Turing machine with Godel 
number e. Then f(x\ , . . . ,x„) = U([.iyT„(e,x \, . . . ,x„,y)). Since T„ is primitive 
recursive, fiyT„(e,xi, . . . ,x n ,y) is partial recursive. Hence, U(^iyT„(e 7 x i, 
. . . , X,, , y) ) is partial recursive. 


COROLLARY 5.7 

The Turing-computable functions are identical with the partial recursive 
functions. 


Proof 

Use Corollaries 5.6 and 5.3. 


COROLLARY 5.8 

Every total partial recursive function is recursive. 


Proof 

Assume that the total partial recursive function f(x is Turing- 
computable by the Turing machine with Godel number e. Then, for all 
x u ...,x„, (3y)T„(e,x u . . . ,x„,y). Hence, iiyT n (e,x i, . . . ,x n ,y) is produced by 
an application of the restricted //-operator and is, therefore, recursive. So, 
U(/iyT n (e,x\ , . . . , x„ , y ) ) is also recursive. Now use Proposition 5.5. 

COROLLARY 5.9 

For any total number-theoretic function /, / is recursive if and only if / is 
Turing-computable. 
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Proof 

Use Corollaries 5.7-5. 8 and Exercise 5.15. 

Church’s thesis amounts to the assertion that the recursive functions are 
the same as the computable total functions. By Corollary 5.9, this k 
equivalent to the identity, for total functions, of computability and Turing 
computability. This strengthens the case for Church’s thesis because of the 
plausibility of the identification of Turing computability with computabil- 
ity. Let us now widen Church’s thesis to assert that the computable func- 
tions (partial or total) are the same as the Turing-computable functions. By 
Corollary 5.7, this implies that a function is computable if and only if it i s 
partial recursive. 


COROLLARY 5.10 

Any number-theoretic function is Turing-computable if and only if it is 
standard Turing-computable. 


Proof 

Use Proposition 5.1, Corollary 5.6 and Proposition 5.2. 


COROLLARY 5.11 (KLEENE’S NORMAL FORM THEOREM) 

As z varies over all natural numbers, U(piyT n (z,x\, , x„ , y) ) enumerates 
with repetitions all partial recursive functions of n variables. 


Proof 

Use Corollary 5.3 and Proposition 5.5. The fact that every partial recursive 
function of n variables reappears for infinitely many z follows from Exercise 
5.7. (Notice that, when z is not the Godel number of a Turing machine, there 
is no y such that T„(z,x i, . . . ,x„,y), and, therefore, the corresponding partial 
recursive function is the empty function.!) 


COROLLARY 5.12 

For any recursive relation 7?(xi , . . . , x n , y ) , there exist natural numbers z 0 and 
vq such that, for all natural numbers jq, . . . ,x„: 


tThe empty function is the empty set 0. It has the empty set as its domain. 
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« 

(a) (3y)R(xi, . • - ,x„,y) ^ and only if {3y)T„{zn,x \, . . . ,x n ,y) 

(b) (Vy)i?(xi , . . . ,xn,y) if and only if (\/y)^T n (v 0 ,x u . . . , x n , y) 

Proof 

(a) The function f(x \ , . . . ,x„) = f.iyR(x i , . . . ,x„,y) is partial recursive. Let zq 
be a Godel number of a Turing machine that computes f. Hence, 

/(x is defined if and only if (3y)T n (z 0 ,x\,. . . ,x„,y). But 

/(x i, . . . ,x„) is defined if and only if (3y)R(x \, . . . ,x„,y). 

(b) Applying part (a) to the recursive relation ->R(x j , . . . , x„, y), we obtain a 
number such that: 

. . . ,x,„>>) if and only if (3y)3;(no,xi, . . . ,x„,y) 

Now take the negations of both sides of this equivalence. 

Exercise 

5.19 Extend Corollary 5.12 to two or more quantifiers. For example, if 
R(x ,y,z) is a recursive relation, show that there are natural numbers 

zo and ro such that, for all xi, . . . ,x„: 

(a) (Vz)(3y)i?(xi, . . . ,x ln y,z) if and only if (Vz)(3y)r„ + i(z 0 ,xi, . . . ,x„,y,z). 

(b) (3z)(Vy)R(x i,. . . ,x„,y,z ) if and only if • • • ,x„,y,z ). 

COROLLARY 5.13 

(a) (3>')7),(xi , a'i , a' 2 , . . . ,x„,y) is not recursive. 

(b) (3y)T„(z,x i , ... ,x„,y) is not recursive. 

Proof 

(a) Assume (3y)T„(xi,xi,x2, ■ ■ . ,x fh y) is recursive. Then the relation 
-i(3y)T„(xi,xi,X2, . . . ,x„,y) Az = z is recursive. So, by Corollary 5.12(a), 
there exists zo such that: 

(3z)(-<(3y)r„(xi,X[,X2, . . . ,x„,y) Az — z) if and only if 

(3z)7ij(zo, X| , * • • i X n J z) 

Hence, since z obviously can be omitted on the left, 

~ '(3y)-//( ( a*[ , , a* 2 , . • . ,x„,y) if and only if f 3z) L (zo , xi , a* 2 , . . . ,Xh,z) 

Let xj = X 2 = ■ • • = x n = zq. Then we obtain the contradiction 

-.(3>>)iL(zo,zo,zo, • • • ,zo,y) if and only if (3z)7i i (z 0 ,zo,zo, • • • ,z 0 ,z) 

(b) If (3y) T n (z, xj , X 2 , ...,x»,y) were recursive, so would be, by substitu- 
tion, (3y)r„(xi,xi,X2, . . . ,x„,y), contradicting part (a). 



COMPUTABILITY 


Exercises 


5.20 Prove that there is a partial recursive function g(z> x) such that, for 
any partial recursive function f(x), there is a number z 0 for which 
f(x) = g(zo,x) holds for all x. Then show that there must exist a number t> 0 
such that g(v o, Vo) is not defined. 

5.21 Let /i)(xt, . . . ,x„), . . . ,hk(x i, . . . ,x K ) be partial recursive functions, and 
let Ri (xj , . . . , x„), . . . , Rk(x i , . . • ,x„) be recursive relations that are exhaustive 
(i.e., for any Xj,...,x„, at least one of the relations holds) and pairwise 
mutually exclusive (i.e., for any xj,...,x„, no two of the relations hold). 
Define 



h i (x* i j . . * , Xfi ) if R [ (a Xjj ) 

hk (a A/i ) if Rk (a | , . • * , A// ) 


Prove that g is partial recursive. 

5.22 A partial function /(x) is said to be recursively completable if there is a 

recursive function h(x) such that, for every x in the domain of /, h(x) = f(x). 

(a) Prove that pyTi(x,x,y) is not recursively completable. 

(b) Prove that a partial recursive function /(a) is recursively completable if 
the domain D of / is a recursive set - that is, if the property ‘x € & is 
recursive. 

(c) Find a partial recursive function / (x) that is recursively completable but 
whose domain is not recursive. 


5.23 If R(x,y) is a recursive relation, prove that there are natural numbers 
zo and vo such that: 

(a) -.[(3 j)/?(z 0 ,t) ^ (V^)-i7i(zo,z 0 ,t)] 

(b) — 1 [ ( V>’) A? ( Uo > >') <=> {^y)T\ (t'o, r 0 ,y)] 

5.24 If S(x) is a recursive property, show that there are natural numbers z 0 
and vo such that: 

(a) -.[^(zo) (Vy)--7i (z 0 , z 0 , y)] 

(b) -^[S(v 0 ) <4> (3y) 71 (v 0 , r 0 ,>')] 

5.25 Show that there is no recursive function B(z,x \, . . . ,x„) such that, if z 
is a Godel number of a Turing number 2T and k \ , . . . , k n are natural numbers 
for which / (ki , . . . , k„) is defined, then the number of steps in the com- 
putation of f&-jt(k\, . . . , k„) is less than B{z,k\ , . . . , k„). 

Let ST be a Turing machine. The halting problem for .T is the problem of 
determining, for each tape description p, whether 2T is applicable to p, that 
is, whether there is a computation of S' that begins with p. 

We say that the halting problem for S' is algorithmically solvable if there 
is an algorithm that, given a tape description p, determines whether S' is 
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applicable to p. Instead of a tape description p, we may assume that the 
algorithm is given the Godel number of p. Then the desired algorithm will 
be a computable function Hg- such that: 

{ 0 if x is the Godel number of a tape description P 
to which S' is applicable 
l otherwise 

If we accept Turing algorithms as exact counterparts of algorithms (that is, 
the extended Church’s thesis), then the halting problem for S' is algorith- 
mically solvable if and only if the function Hg- is Turing-computable, or 
equivalently, by Corollary 5.9, recursive. When the function//^ is recursive, 
we say that the halting problem for S' is recursively solvable. If Hg- is not 
recursive, we say that the halting problem for S' is recursively unsolvable. 


PROPOSITION 5.14 

There is a Turing machine with a recursively unsolvable halting problem. 


Proof 

By Proposition 5.2, let S be a Turing machine that ST-computes the partial 
recursive function nyT\(x,x,y). Remember that, by the definition of stan- 
dard Turing computability, if S' is begun on the tape consisting of only x 
with its reading head scanning the leftmost |, then S' stops if and only if 
fiyTi(x,x,y) is defined. Assume that S' has a recursively solvable halting 
problem, that is, that the function Hg- is recursive. Recall that the Godel 
number of the tape description q$x is 2 9 * Num(x). Now, 

(3>')7i (.v,x, >') if and only if yyT\(x,x,y) is defined 

if and only if S’, begun on qox, performs a computation 
if and only if Hg-( 2 9 * Num(x)) = 0 

Since Hg-, Num and * are recursive, (3>’)Zi(x,x,j-) is recursive, contradicting 
Corollary 5.13(a) (when n = 1). 

Exercises 

5.26 Give an example of a Turing machine with a recursively solvable 
halting problem. 

5.27 Show that the following special halting problem is recursively un- 
solvable: given a Godel number z of a Turing machine S' and a natural 
number x, determine whether S' is applicable to q§x. 

5.28 Show that the following self-halting problem is recursively unsolvable: 
given a Godel number z of a Turing machine S', determine whether S' is 
applicable to q$z. 
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5.29 The printing problem for a Turing machine S' and a symbol a k is the 
problem of determining, for any given tape description a, whether S', begun 
on a, ever prints the symbol a k . Find a Turing machine S' and a symbol a k 
for which the printing problem is recursively unsolvable. 

5.30 Show that the following decision problem is recursively unsolvable: 
given any Turing machine S', if S' is begun on an empty tape, determine 
whether S' stops (that is, whether S' is applicable to qoB). 

5.31 d Show that the problem of deciding, for any given Turing machine, 
whether it has a recursively unsolvable halting problem is itself recursively 
unsolvable. 

To deal with more intricate decision problems and other aspects of the 
theory of computability, we need more powerful tools. First of all, let us 
introduce the notation 

<?"(*!, • • • > * 1 ! ) = U(pyT n (z,3 Cl, . . . ,*„,>’)) 

Thus, by Corollary 5.1 1, (p$, (p } { , <p%, ■ . • is an enumeration of all partial re- 
cursive functions of n variables. The subscript j is called an index of the 
function (p’j . Each partial recursive function of n variables has infinitely 
many indices. 

PROPOSITION 5.15 (ITERATION THEOREM 
OR s-m-n THEOREM) 

For any positive integers m and n, there is a primitive recursive function 
s } ”(z,y u ...,y m ) such that 

Thus, not only does assigning particular values to z,yi,...,y,„ in 
(p” t+n (x i, . . . ,x„,yi, . . . ,y m ) yield a new partial recursive function of n vari- 
ables, but also the index of the resulting function is a primitive recursive 
function of the old index z and of y\ , . . . , y m . 

Proof 

If S' is a Turing machine with Godel number z, let S ' be a Turing 

machine that, when begun on (xi, . . . ,x„), produces (xi, ... ,x„,yi, ... 
moves back to the leftmost | of xf, and then behaves like S'. Such a machine 
is defined by the diagram 

Rr(air) JI+1 r(air)^‘ l " 1 r . . . r(airy’ n+l SrS 

The Godel number ^'(z,yj , . . . ,y m ) of this Turing machine can be effectively 
computed and, by Church’s thesis, would be partial recursive. In fact, d’f can 
be computed by a primitive recursive function g(z,yi , . . . ,y m ) defined in the 
following manner. Let t = y\ + . . . + y m + 2m + 1. Also, let u(i) = 
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2 9+4/ 3 7 5 n 7 9+4« and v (i) = 2 9+4i 3 ll 5 3 7 13+4f . Notice that u(i) is the Godel 
number of the quadruple q t B\qi and v(i) is the Godel number of the quad- 
ruple qt\Rq i+ i. Then take g(z,yi ,. . . ,y m ) to be: 


,2 9 3 "5 3 7 9 i 2 9 3 7 5 3 7 13 -72 , 3 3 7 5 7 7 ' 7 


y 1+2 


II Pm -*\ P \*-1\ * 2 


2944(yj 43) 3 7 +4(yj +4) 


i —2 

>1 -i'J'2+4 

n u(i) v(i) 

^2|i— {yi+ 4 )\P 2 \i— (yi+4)[+l 

(=yi +4 

n 2 9 + 4 (yj 4>2 + 5 ) 3 7 5 3 T^+^O'l +J2+ 6 ) 

2 - * ... * 

2 9 + 4 0i +— +>m— 1 +2nt-l) ^7 53 y9-h4(,vj ]+2m) 

Z “ * 

yi + 2 m 

n u(i) v(i) 

Pi\i - O'! +. . .+v m _i +2 m) \P 2 \i- {y\+... +y m - 1 +2m ) | + 1 * 

i-yi+...+y„,-l+ 2 m 

2 9+4r ^^ ^5^9+4/ 2 9 + 4f '3 7 55 < 7 9 +4(r+l) _2 9 + 4 (*+03ll 5579+4/ 

ia W « 

_2 9 h ('+ | )' 1 7 5 3 7 <; i4 ('+ ;> ) j ! 2 9 +‘ , ( ,+ 2 )3 7 5 3 7® + ‘ , ('+ 3 ) 

7 " 11 " * 


«(«(*)) 


n p‘< 


2((^)i>0^' 4 ( r “ h3 > 3((->r )i 5(( x )i)2 7U 7 )/)3+ 4 ( /+3 ) 


1=0 


g is primitive recursive by the results of Section 3.3. When z is not a Godel 
number of a Turing machine, (p” 1+,> is the empty function and, therefore, 
s*(z,yi, . . . ,+„) must be an index of the empty function and can be taken to 
be 0. Thus, we define: 

j »( 7 Vl v \ _ / g ( z , yi -- ■ ■ ■>’»,) if tm ( z ) 

W,yt, ■■■,)„) - S 0 otherwise 


Hence, s'” is primitive recursive. 


COROLLARY 5.16 

For any partial recursive function f(x 1, . . . there is a recur- 

sive function g(yu ■ . ■ ,y m ) such that 

T(x'i, . . . . . . ,y tn ) *Py(yi ,... ■ • • >^«) 

Proof 

Let e be an index of /. By Proposition 5. 1 5, 

(p e (-* I ) • • ■ )^7J)Tl ) • • • J J/h) ( /+ r: ((\Vl (‘*'1 » • ‘ • )■*■«) 

Let g(yu . . . ,y m ) = 




332 



Examples 

1. Let G(x) be a fixed partial recursive function with non-empty domain. 
Consider the following decision problem: for any u, determine whether 
a) 1 = G. Let us show that this problem is recursively unsolvable, that is 
that the property R(u), defined by (p' u = G, is not recursive. Assume, for 
the sake of contradiction, that R is recursive. Consider the function 
f(x,u) = G{x)-N(Z{p.yT\{u,u,y))). (Recall that N(Z(t)) = 1 for all t). 
Applying Corollary 5.16 to /(*, u), we obtain a recursive function g(iA 
such that f(x,u ) = <pl(u) (*) • For a ny fixed u, (p l g(tl) = G if and only if 
(3y)7i(«, u,y). (Here, we use the fact that G has non-empty domain.) 
Hence, (3y)Ti(u, u,y) if and only if R(g(u)). Since R(g(u)) is recursive, 
(3y)Ti(u,u,y) would be recursive, contradicting Corollary 5.13(a). 

2. A universal Turing machine. Let the partial recursive function 
U(fiyTi(z,x,y)) be computed by a Turing machine i r with Godel number 
e. Thus, U(p.yTi(z,x,y)) = U(pyT 2 (e,z,x,y)). i r is universal in the fol- 
lowing sense. First, it can compute every partial recursive function / (*) of 
one variable. If z is a Godel numbe r of a Turing machine that computes/, 
then, if i r begins on the tape (z,x), it will compute U(pyT\ (z,x,y)) = f(x). 
Further, i r can be used to compute any partial recursive function 
h(x i, . ..,*„). Let vo be a Godel number of a Turing machine that com- 
putes h, and let f(x) = h((x) 0 , (x), , . . . , (x) n _, ). Then h(x u ...,x tl ) 
= f(Po ■ ■ ' Pit— l ) ■ fiy appl^ang Corollary 5.16 to the partial recursive 
function U (gy T„ (v, (x) 0 , (x) p . . . , (x) n _i ,>’))> we obtain a recursive func- 
tion g(v) such that U(py T„(v, (x) 0 , (x) 1? . . . , (x)„_ 1; y)) = <p\ v) (x). Hence, 

f(x) = (p 1 ,, (x). So h(x i, . . . ,x„) is computed by applying i r to the tape 

(gM,Pq 


Exercises 

5.32 Find a superuniversal Turing machine such that, for any Turing 
machine Of, if z is a Godel number of £T and x is the G odel n umber of an initial 
tape description a of HT, then i r * is applicable to qo(z,x) if and only if AT is 
applicable to a; moreover, if^", when applied to a, ends with a tape description 
that has Godel number w, then i r * , when applied to qo(z,x), produces w. 

5.33 Show that the following decision problem is recursively unsolvable: for 
any u and v, determine whether (pi = (p\. 

5.34 Show that the following decision problem is recursively unsolvable: for 
any u, determine whether (p] t has empty domain. (Hence, the condition in 
Example 1 above, that G(x) has non-empty domain is unnecessary). 

5.35 

(a) Prove that there is a recursive function g(u, v) such that 

(Pl(u,v ) (*) = rf(*) ■ vl( x ) 

(b) Prove that there is a recursive function C(w, v) such that 
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vM(x)) 


5.4 THE KLEENE-MOSTOWSKI HIERARCHY. 

RECURSIVELY ENUMERABLE SETS 

Consider the following array, where R(x 1} . . . ,x„,yi, . . . ,y m ) is a recursive 
relation. 

(3yi )R(xi , . . . , X/j , y\ 1 (Vyi ‘ i jri ) 

(Byi)(Vy 2 )R(xi , • • • (Vyi)(3^)/?(xi, . . .,x„,y h y 2 ) 

(3yi)(Vy 2 )(3y 3 )R(x l , - . . ,-v„,yi,y2,J’3) (Vyi)(3y 2 )(Vy 3 )/?(xi, . . . ,x„,yi,y 2 ,y 3 ) 


Let ]Eq = IIq = the set of all 77 -place recursive relations. For k > 0, let Eit 
be the set of all 77 -place relations expressible in the prenex form 

(3jFi)(Vy 2 ) • • • (Qyk)R(* 1 , • • • ,x n ,yi, . . . , 3 *), consisting of k alternating quan- 
tifiers beginning with an existential quantifier and followed by a recursive 
relation R. (Here, \Qyk)' denotes (3>fr) or (Vj^), depending on whether k is 
odd or even.) Let IT” be the set of all 77 -place relations expressible in the 
prenex form (Vyi)(3y 2 ) - • • {Qyk)R{x \, . . . ,x w ,Ti, • • • ,yk), consisting of k al- 
ternating quantifiers beginning with a universal quantifier and followed by a 
recursive relation R. Then the array above can be written 

Eo 

u re 

e" re 

e" re 


This array of classes of relations is called the Kleene-Mo stow ski hierarchy , 
or the arithmetical hierarchy. 


PROPOSITION 5.17 


(a) Every relation expressible in any form listed above is expressible in all 
the forms in lower rows; that is, for all j > k. 


EleBnir 



(b) There is a relation of each form, except Eo> that is not expressible in 
the other form in the same row and, hence, by part (a), not in any of the 
rows above; that is, for k > 0, 






z:-ni^ «* 


(c) Every arithmetical relation is expressible in at least one of these 
forms. 

(d) (Post) For any relation 0(xj , • • • , x„), 0 is recursive if and only if both O 
and ->Q are expressible in the form (3ji )R(x x , . . . ,x„,>’i), where R is 
recursive; that is, FI II" = 

(e) If 0, e YJk and 02 e YH> then 01 v 02 and Q\ A 02 are in If 

0i G n" and 02 G nj*, then Q x V 0 2 and 0! A 02 are in II£. 

(f) In contradistinction to part (d), if k > 0, then 

( z;> ni + , ) - (2> n: ) *0 


Proof 

(a) (3z\ ) (Vyi ) . . . (3zjt) (Vyk)R(x i , . . . , x„ , zi , y x , . . . , z k , y k ) & 

(Vi/)(3zi)(Vyi) . . . (3zjt)(Vj%)(R(xi, . . . ,x„,zi, >>i, . . .,z k ,yy) A u = u) <=$■ 

(3ri)(Vyi) . . . (32jt)(Vnt)(3i/)(/?(x 1 , . . . ,x„,z u y x ,. . . ,z k ,y k ) Ak = u) 

Hence, any relation expressible in one of the forms in the array is 
expressible in both forms in any lower row. 

(b) Let us consider a typical case, say ^3. Take the relation 
(3i;)(Vz)(3;y)T H+ 2(*i ,xi ,*2, ■ • • >x„, v,z,y), which is in ^3. Assume that 
this is in 11", that is, it is expressible in the form (Vr)(3z)(V>’) 
R(x 1, . . . ,x„, v,z, y), where R is recursive. By Exercise 5.19, this relation 
is equivalent to (Vi;)(3z)(Vy)-ir„ + 2(e,xi, . . . ,x„, v,z,y ) for some e. When 
x x — - e, this yields a contradiction. 

(c) Every wf of the first-order theory S can be put into prenex normal form. 
Then, it suffices to note that (3u)(3v)R(u, v) is equivalent to (3z)/?((z) 0 , 
(z)j), and (Vw)(Vr)i?(«, v) is equivalent to (Vz)P((z) 0 , (z) t ). Hence, 
successive quantifiers of the same kind can be condensed into one such 
quantifier. 

(d) If Q is recursive, so is ->0, and, if P(x x ,x„) is recursive, then 

P(xi , . . . ,x„) (3y)(P(x\, . . . ,x n ) Ay = y). Conversely, assume 0 is 

expressible as (3_y)Pi(xi, . . . ,x n ,y) and -1 0 as (3_y)i?2(^i, • • - ,x, n >>), 
where the relations R x and R 2 are recursive. Hence, (Vxi ) . . . 
(Vx w ) (3^) (Ri (xi, . . . ,x n ,y) V R 2 (x 1, . . . ,x„,>>)). So, iff (xi, . . . ,x„) = 
py(R] (xi, . . . ,x„,y) V /?2(xi, . . . ,x„,}>)) is recursive. Then, 0(xi , . . . ,x„) 
<=> R\ (xi, . . . ,x„, «//(xi, . . . ,x„)) and, therefore, Q is recursive. 

(e) Use the following facts. If x is not free in &/. 

h (3x)(,c/ V S8) <*=> (x/ V (3x)^), h (3x)(x/ A&) <=k (x/ A (3x)^?), 
h (Vx) (s/ V SS) 0/ V (Vx)^), h ( Vx) (s/ A 39) ^ {.d a (Vx)^) 
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-(f) We shall suggest a proof in the case n = 1; the other cases are then easy 
consequences. Let G(*) £ ~ IXt* Define P(x) as (3z) [(x = 2zA 

Q(z))V (x = 2z + 1 A -i Q(z))]. It is easy to prove that P ^ u and 
that P E Observe that P(x) holds if and only if 

(3z)(x — 2zl\ Q(z)) V ((3z) z<T (x = 2z + 1) A (Vz) (x -= 2z + 1 ~>Q(z))) 

Hence, P E Il{: +1 (Rogers, 1959). 

Exercises 

5.36 For any relation W of n variables, prove that W E Ya if and only if 
W E ni, where W is the complement of W with respect to the set of all 
n-tuples of natural numbers. 

5.37 For each k 0, find a universal relation in 'y 3 - , that is, for any 

relation W of n variables: (a) if W E Ya-> ti ierl there exists zo such that, for all 
* 1 , . - . , x„, W(x i, . . . ,x„) if and only if J^(zo,x h . . . ,x„); and (b) if W E Yll’ 
there exists Vq such that, for all jq, . . . ,x„, W(x \, . . . ,x n ) if and only if 
-'Vk{vQ-,x \, . . . ,x„). [Hint: Use Exercise 5.19.] 

The s-m-n theorem (Proposition 5.15) enables us to prove the following 
basic result of recursion theory. 

PROPOSITION 5.18 (RECURSION THEOREM) 

If n > 1 and f(x \, . . . ,x„) is a partial recursive function, then there exists a 
natural number e such that 

J (Xl j • • ■ j Xn — 1 , c) - (p e (xi , . . . , Xfi~ 1 ) 

Proof 

Let d be an index of f{x\ , . . . , x„_i, ^j(x rt ,v rt )). Then 

f(x\-, . - - iXff — 1 , S n __ j (Xn , X/j ) ) (Prf(x]j . . . , 

By the s-m-n theorem, 
s* 1 (d,d). Then: 

f(x i , . . . , Xu— i , c) f(xi > - - ■ ) Xh_i , iS n _j (d , rf)) (p d (x"i x n — i , d ) 

1 l) W (xi i • • ■ , X/j. ] ) 

COROLLARY 5.19 (FIXED-POINT THEOREM) 

If h(x) is recursive, then there exists e such that (p\ — (p)^ e y 


(p n d {x\ , . . . , x„) = <p” t * ^ (vi, . . . , x„_i) . Let e = 




COMPUTABILITY 


Proof 

Applying the recursion theorem to /(x,w) — <p\( u \{x), we obtain number e 
such that /(x, e) = (p\ (x) . But /(x, e) = <P/,( e) (x). 

COROLLARY 5.20 (RICE’S THEOREM) (RICE, 1953) 

Let be a set consisting of at least one, but not all, partial recursive 
functions of one variable. Then the set A — {u\(p ] tl E iF} is not recursive. 

Proof 

By hypothesis, there exist numbers u\ and such that U\ E A and ^ A. 
Now assume that A is recursive. Define 

h(r\ = I Ul if X ^ ^ 

W (i / 2 if X £ yl 

Clearly, /j(x) e ^ if and only if x 0 A h is recursive, by Proposition 3.19. By 
the fixed-point theorem, there is a number e such that <p ] e = (pL e y Then we 
obtain a contradiction as follows: 

e £ A if and only if (pi £ 2? 

if and only if cp)^ £ g 7 

if and only if h(e) £ A 
if and only if e £ A 

Rice’s theorem can be used to show the recursive unsolvability of various 
decision problems. 

Example 

Consider the following decision problem: for any u, determine whether (p\ t 
has an infinite domain. Let be the set of all partial recursive functions of 
one variable that have infinite domain. By Rice’s theorem, { u j (p\ t E iF} is 
not recursive. Hence, the problem is recursively undecidable. 

Notice that Example 1 on page 332 and Exercise 5.34 can be handled in 
the same way. 

Exercises 

5.38 Show that the following decision problems are recursively unsolvable. 

(a) For any u, determine whether (p ] u has infinite range. 

(b) For any u, determine whether is a constant function. 

(c) For any u, determine whether q>\ t is recursive. 

" 7 -. 
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5.39 

(a) Show that there is a number e such that the domain of <p\ is {<?}. 

(b) Show that there is a number e such that the domain of (p\ is eo-{e}. 

5.40 This exercise will show the existence of a recursive function that is not 
primitive recursive. 

(a) Let [y/x\ be the largest integer less than or equal to \/x. Show that [y/x\ 
is defined by the recursion 

k(0) = 0 

k(x + 1) = k(x) + sg|(x + 1) — (k(x) + 1) 2 | 

Hence, [yG] is primitive recursive. 

(b) The function Quadrem(.x) = x~ [V*] 2 is primitive recursive and rep- 
resents the difference between x and the largest square less than or equal 
to x. 

(c) Let p(x, y) = ((* +j>) + y) +x, p](z) = Quadrem(z), and p 2 (z) = 
Quadrem([\/z]). These functions are primitive recursive. Prove the 
following: 

(i) pi (/>(*, >0) = x and p^ipfay)) = y. 

(ii) p is a one-one function from co 2 into co. 

(iii) Pi(0) = p 2 (°) = 0 and 

Py(x + 1) = Pi(x) + 1 

Pl{x+ 1) = P 2 (*) 

(iv) Let p 2 denote p, and, for n > 3, define p"( x\,...,x„) 

= p(p t, ~ 1 (xi, . . . ,x„-i),x n ). Then each p n is primitive recursive. 
Define p'l(x) — p”~ ! (pi(x)) for — 1, and p"( x ) — p 2 ( x )- 

Then each p",l is primitive recursive, and 

p'j(p n (x i, . . . , x n ) ) = Xj. Hence, p n is a one-one function of of into 
co. The p n s and tlie p’j s are obtained from p, p\ and p 2 by sub- 
stitution. 

(d) The recursion rule (V) (p. 174) can be limited to the form 

f{xi j ■ • • > Xn -|- [ , 0) (rt ^ 0) 

f(x\ , • . . , , y t 1 ) , . . • , F(x\ , . . . , ,y)) 

[Hint: Given 

f\X 1 , ■ • - ,X‘n, 0) ()(x 1 , . . ■ ,X‘n) 

f(xij . . . jX nj y T 1) h(x i, . . . i Xfj^y i f{xy^ - . . >x’n,v)) 

define F as above, letting G(xi, . . . ,x n+ i,y,z) = h(x\ , . . . ,x n ,y,z). Then 
f (x \ , . . . , Xfi^y} F(x \ , . . . , Xu , q(x\ , . . . , v n ) ■] 

(e) Taking x + y,x ■ y, and [\fx] as additional initial functions, we can limit 
the recursion rule to the one-parameter form: 


J if Pi(* + 1)#0 
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F( x, 0) = G(x) 

F(x,y+1) =H(x,y,F(x,y)) 

[Hint: Let n > 2. Given 

f(x\ 5 • • • j j 0 ) 0 (x*l , ■ - * , X//) 

f(xi » • • ■ i x n j y T 1) h(x x j • • - > Xn , yif(x \ , . . . , x 7I , y)) 
let F(u,y) ~ f(pi(u ), . . . , Define F by a permissible recursion. 

(Note that <5(x), x — y, p n and p" are available.) /(xj, . . . ,x n ,y) 
= F(p n (x 1 ,...,x„),y).] 

(f) Taking x +y, x • y, and [^/x] as additional initial functions, we can use 
h(y,F(x,y )) instead of H(x,y,F(x,y)) in part (e). 

[Hint: Given 

F(x, 0) = G(x) 

F(x,y + \)=H(x,y,F(x,y)) 

let Fi(x,y) = p(x,F(x,y)). Then * = Pi(Fi(x,y)) and F(x,y) 
= p 2 (F\ (x, y) ) . Define Fi (x, y) by a permissible recursion.] 

(g) Taking x + y\ x • y, and [y/x ] as additional initial functions, we can limit 
uses of the recursion rule to the form 

f(x, 0 ) — x 

f(x,y+ 1 ) =h(y,f(x,y)) 


Hint: Given 


F(x, 0) - G(x) 

F(x,y+ \) = h(y,F(x,y)) 

define / as above. Then/(x,y) =f(G(x),y). 

(h) Taking x + y, x • y, [\/x] and x—y as additional initial functions, we can 
limit uses of the recursion rule to those of the form 

0 (O) = 0 

g(y+ 1 )=H(y, g(y)) 

[Hint: First note that |x — y\ — (x—y) + (y— x) and that [\fx] is defin- 
able by a suitable recursion. Now, given 

f(x, 0) =x 

f(x,y+ 1) = h(y,f(x,y)) 


let g(x) = f(p 2 (x) y pi (x)). Then 
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0(O)=/(P 2 (O),Pi(O))-/(O,O)=O 

g(x -f 1) = f(p 2 {x + 1), Pi (x + 1)) 

__ f P 2 (x + 1) if Pi(* + 1) - o 

l h{Pi(x + 1) — l,f(p 2 (x + l)i Pi( x + 1) ~ 1)) if Pi(* + 1) 7^ 0 

f P 2 (x +1) if pi(x + 1) = 0 

l h(Pi(x),f(Pi(x), P 2 (x))) if Pi(x + 1) ^ 0 

= fp 2 (x+l) if Pi( x + 1) = 0 

l HPi(x),g(x)) if pi (x + 1) 7^ 0 

= Pi(x + 1) ■ sg(pi(x + 1)) + /?(pi(x),p(x)) ■ sg (p, (x + 1)) 

= H{x, g(x)) 

Then f(x,y) — g(p(y,x)). (Note that sg is obtainable by a recursion of 
the appropriate form and sg(x) = 1 — x.) 

(i) In part (h), H(y , g(y)) can be replaced by H(g(y)). 

[Hint: Given 

0(0) = 0 

0(y+ 1 )=H{y,g(y)) 

let f(u) = p(u,g(u)) and <p(w) = p(pi(w) + 1 ,/f(p 1 (w),p 2 (w))). Then 

m = 0 

/0'+0 = H/’00) 

and g(u) = p 2 {f{ u ))- (Note that sg(x) is given by a recursion of the 
specified form.) 

(j) Show that the equations 

0) = x + 1 

^(0,y+ 1) = 1^(1, y) 

\}/(x+l,y+l) = M{x,y + i),y) 

define a number-theoretic function. In addition, prove: 

(I) *l*(x i y)>x. 

(II) ij/(x,y) is monotonic in x, that is, if x<z, then \j/(x,y) < \ p(z,y). 

(III) \j/(x + 1 ,y) <ij/{x,y + 1). 

(TV) i j/(x,y) is monotonic in y, that is, if y < z, then t//(x,y) 

(V) D Use the recursion theorem to show that i p is recursive. [Hint: Use 
Exercise 5.21 to show that there is a partial recursive function g such 
that 0 (x,O,«) —x + l, 0 (O,y+ 1,k) = <pj(l,y), and g(x + l,y + l,w) = 
cp 2 ((pl(x y y + 1 ) , y) . Then use the recursion theorem to find e such that 
<?(x,y, e) = (pl(x,y). By induction, show that g(x,y,e) = i p{x,y).] 

(YI) For every primitive recursive function f(x i, . • . there is some fixed 
m such that 


f{x i, . . . ,x„) < ij/(max(x, , . . . ,x„),m) 
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for allxi, . . . , x„ . [Hint: Prove this first for the initial functions Z,N, U“ 
x+y, x x y, [V5F] and x—y, and then show that it is preserved by 
substitution and the recursion of part (i).] Hence, for every primitive 
recursive function / (x), there is some m such that / (x) < »//(x, m) for all x, 
(VII) Prove that t//(x, x) + 1 is recursive but not primitive recursive. 

For other proofs of the existence of recursive functions that are not 
primitive recursive, see Ackermann (1928), Peter (1935; 1967), and 
R.M. Robinson (1948). 

A set of natural numbers is said to be recursively enumerable (r.e.) if 
and only if it is either empty or the range of a recursive function. If we 
accept Church’s thesis, a non-empty recursively enumerable set is a 
collection of natural numbers generated by some mechanical process 
or effective procedure. 


PROPOSITION 5.21 

(a) A set B is r.e. if and only if x E B is expressible in the form (3y)7?(x,y), 
where R is recursive. (We even can allow R here to be primitive re- 
cursive.) 

(b) B is r.e. if and only if B is either empty or the range of a partial recursive 
function.^ 

(c) B is r.e. if and only if B is the domain of a partial recursive function. 

(d) B is recursive if and only if B and its complement B are r.e.! 

(e) The set K = {x|(3y)Ti(x,x,y)} is r.e. but not recursive. 

Proof 

(a) Assume B is r.e. If B is empty, then x E B (3y) (x 7 ^ x A y ^ y). If B is 
non-empty, then B is the range of a recursive function g. Then 
(3y)(g(y) = x). Conversely, assume x (3y)7?(x,y), where R is 

recursive. If B is empty, then B is r.e. If B is non-empty, then let k be a fixed 
element of B. Define 

flfzl-/* if ^W(z)o. (2)1) 

W_ \Mo if /!((*)„,(*),) 

6 is recursive by Proposition 3.19. Clearly, B is the range of 6. (We can take 
R to be primitive recursive, since, if 7? is recursive, then, by Corollary 5.12(a), 
(3y)/?(x,y) (3_y)7\(e,x,y) for some e, and T\ (<?, x, y) is primitive recursive.) 


! Since the empty function is partial recursive and has the empty set as its range, 
the condition that B is empty can be omitted. 

fg = co — B, where co is the set of natural numbers. 
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(b) Assume B is the range of a partial recursive function g. If B is empty, 
then B is r.e. If B is non-empty, then let k be a fixed element of B. By 
Corollary 5.11, there is a number e such that g(x) = U(gyT\(e,x,y)). Let 

U \k if ATi(e, (z) 0 ,(z),) 

By Proposition 3.19, h is primitive recursive. Clearly, B is the range of h. 
Hence, B is r.e. 

(c) Assume B is r.e. If B is empty, then B is the domain of the partial 
recursive function fiy(x + y + 1 = 0). If B is non-empty, then B is the range 
of a recursive function g. Let G be the partial recursive function such that 
G'(y) = f.Lx(g(x) = y). Then B is the domain of G. Conversely, assume B is 
the domain of a partial recursive function H. Then there is a number e such 
that H(x) = U(fxyTi(e,x,y )). Hence, H(x) = z if and only if (3y)(7i (e,x,y)A 
U(y) = z). But, x E B if and only if (3z) (H(x) — z). So, x e B if and only if 
(3z)(3y)(7\(e,x,y) A U{y) — z), and the latter is equivalent to (3w)(7j(e,x, 
(h)j) A [/((m)]) = (h) 0 ). Moreover, 7\(e,jc, (w) j ) A [/((w^) =(«) 0 is recur- 
sive. Thus, by part (a), B is r.e. 

(d) Use part (a) and Proposition 5.17(d). (The intuitive meaning of part 
(d) is the following: if there are mechanical procedures for generating B and 
B, then to determine whether any number n is in B we need only wait until n 
is generated by one of the procedures and then observe which procedure 
produced it.) 

(e) Use parts (a) and (d) and Corollary 5.13(a). 

Remember that the functions <pl(x) — U {fiyT\ (n,x,y)) form an enumer- 
ation of all partial recursive functions of one variable. If we designate the 
domain of (p\ by W„, then Proposition 5.21(c) tells us that Wq , W\, W 2 , ... is 
an enumeration (with repetitions) of all r.e. sets. The number n is called the 
index of the set W„. 


Exercises 

5.41 Prove that a set B is r.e. if and only if it is either empty or the range of 

a primitive recursive function. [Hint: See the proof of Proposition 5.21(b).] 

5.42 

(a) Prove that the inverse image of a r.e. set B under a partial recursive 
function / is r.e. (that is, (x|/(v) e B) is r.e.). 

(b) Prove that the inverse image of a recursive set under a recursive func- 
tion is recursive. 

(c) Prove that the image of a r.e. set under a partial recursive function is 
r.e. 

(d) Using Church’s thesis, give intuitive arguments for the results in parts 
(a)-(c). 
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(e) Show that the image of a recursive set under a recursive function need 
not be recursive. 

5.43 Prove that an infinite set is recursive if and only if it is the range of a 
strictly increasing recursive function, (g is strictly increasing if x < y implies 
g(x) < g(y).) 

5.44 Prove that an infinite set is r.e. if and only if it is the range of a one- 
one recursive function. 

5.45 Prove that every infinite r.e. set contains an infinite recursive subset. 

5.46 Assume that A and B are r.e. sets. 

(a) Prove that AuB is r.e. [In fact, show that there is a recursive function 
g(u, v) such that W g ( UyV ) = W lt U W v .] 

(b) Prove that AnB is r.e. [In fact, show that there is a recursive function 
h(u,v) such that JV/,g ljV ) = W u nW v .] 

(c) Show that A need not be r.e. 

(d) Prove that (J„ e a % * s re * 

5.47 Show that the assertion 

(y) A set B is r.e. if and only if B is effectively enumerable (that is, 
there is a mechanical procedure for generating the numbers in B) 
is equivalent to Church’s thesis. 

5.48 Prove that the set A = {«( W u = co} is not r.e. 

5.49 A set B is called creative if and only if B is r.e. and there is a partial 
recursive function h such that, for any n, if W n C B, then h(n) e B — W„. 

(a) Prove that {x|(3y)7i (x,x,y)} is creative. 

(b) Show that every creative set is non-recursive. 

5.50 d A set B is called simple if B is r.e., B is infinite, and B contains no 
infinite r.e. set. Clearly, every simple set is non-recursive. Show that a simple 
set exists. 

5.51 A recursive permutation is a one-one recursive function from co onto to. 
Sets A and B are called isomorphic (written A ~ B) if there is a recursive 
permutation that maps A onto B. 

(a) Prove that the recursive permutations form a group under the opera- 
tion of composition. 

(b) Prove that ~ is an equivalence relation. 

(c) Prove that, if A is recursive (r.e., creative, simple) and A — B, then B is 
recursive (r.e., creative, simple). 

Myhill (1955) proved that any two creative sets are isomorphic. (See also 
Bernays, 1957.) 

5.52 A is many-one reducible to B (written viR m i?) if there is a recursive 
function / such that u E A if and only if / (u) E B. (Many-one reducibility of 
A to B implies that, if the decision problem for membership in B is recur- 
sively solvable, so is the decision problem for membership in A) A and B are 
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called many-one equivalent (written A = m B ) if ^4R m 5 and 7?R m /4 . A is one- 
one reducible to B (writtten ,4Ri5) if there is a one-one recursive function / 
such that u e A if and only if f(u) e B. A and B are called one-one equivalent 
(written A =1 B) if /IRi# and 5Ri/l. 

(a) Prove that = m and =1 are equivalence relations. 

(b) Prove that, if A is creative, B is r.e., and /lR m i?, then B is creative. 
[Myhill (1955) showed that, if A is creative and B is r.e., then 7?R m /l . ] 

(c) (Myhill, 1955) Prove that, if A\^ X B then /JR m £, and if A =1 B then 
A = m B. However, many-one reducibility does not imply one-one re- 
ducibility, and many-one equivalence does not imply one-one equiv- 
alence. [Hint: Let A be a simple set, C an infinite recursive subset of A, 
and B = A — C. Then /IRii? and 5R mJ 4 but not-(i?Rp4).] It can be 
shown that A =1 B if and only if A ~ B. 

5.53 (Dekker, 1955) A is said to be productive if there is a partial 
recursive function / such that, if W„ C A, then f(n)eA — JV„. Prove the 
following. 

(a) If A is productive, then A is not r.e.; hence, both A and A are infinite. 

(b) D If A is productive, then A has an infinite r.e. subset. Hence, if A is 

productive, A is not simple. 

(c) If A is r.e., then A is creative if and only if A is productive. 

(d) D There exist 2^° productive sets. 

5.54 (Dekker and Myhill, 1960) A is recursively equivalent to B (written 
A ~ B) if there is a one-one partial recursive function that maps A onto B. 

(a) Prove that ~ is an equivalence relation. 

(b) A is said to be immune if A is infinite and A has no infinite r.e. subset. A 
is said to be isolated if A is not recursively equivalent to a proper subset 
of A. (The isolated sets may be considered the counterparts of the 
Dedekind-finite sets.) Prove that an infinite set is isolated if and only if 
it is immune. 

(c) D Prove that there exist 2 N ° immune sets. 

Recursively enumerable sets play an important role in logic because, if we 
assume Church’s thesis, the set 7k of Godel numbers of the theorems of any 
axiomatizable first-order theory K is r.e. (The same holds true of arbitrary 
formal axiomatic systems.) In fact, the relation (see page 198) 

PfK(y,x): y is the Godel number of a proof in K of a wf with Godel 
number x 

is recursive if the set of Godel numbers of the axioms is recursive, that is, if 
there is a decision procedure for axiomhood and Church’s thesis holds. 
Now, x E 7k if and only if (3y)PfK(y, x) and, therefore, 7k is r.e. Thus, if we 
accept Church’s thesis, K is decidable if and only if the r.e. set 7k is re- 
cursive. It was shown in Corollary 3.46 that every consistent extension K of 
the theory RR is recursively undecidable, that is, 7k is not recursive. 
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Much more general results along these lines can be proved (see Smullyan 
1961; Feferman, 1957; Putnam, 1957; Ehrenfeucht and Feferman, 1960; and 
Myhill, 1955). For example, if K is a first-order theory with equality in the 
language a of arithmetic: (1) if every recursive set is expressible in K, then 
K is essentially recursively undecidable, that is, for every consistent exten- 
sion K' of IC, Tk' is not recursive (see Exercise 5.58); (2) if every recursive 
function is representable in K and 1C satisfies conditions 4 and 5 on page 
208, then the set 7k is creative. For further study of r.e. sets, see Post (1944) 
and Rogers (1967); for the relationship between logic and recursion theory, 
see Yasuhara (1971) and Monk (1976, part III). 


Exercises 

5.55 Let K be a first-order theory with equality in the language 3? A of 
arithmetic. A number-theoretic relation B(x i,...,x„) is said to be weakly 
expressible in K if there is a wf 3d(x\ ... . ,x„) of K such that, for any natural 
numbers k \, . . . ,k m B(ki, . . . ,k n ) if and only if h K 33 (k \ , . . . ,k„). 

(a) Show that, if K is consistent, then every relation expressible in K is 
weakly expressible in K. 

(b) Prove that, if every recursive relation is expressible in IC and IC is oo- 
consistent, every r.e. set is weakly expressible in IC. (Recall that, when 
we refer here to a r.e. set B, we mean the corresponding relation 
‘x e B\) 

(c) If K has a recursive vocabulary and a recursive axiom set, prove that 
any set that is weakly expressible in IC is r.e. 

(d) If formal number theory S is co-consistent, prove that a set B is r.e. if 
and only if B is weakly expressible in S. 

5.56 

(a) (Craig, 1 953) Let K be a first-order theory such that the set 7k of Godel 
numbers of theorems of K is r.e. Show that K is recursively axiom- 
atizable. 

(b) For any wf 33 of formal number theory S, let 33# represent its trans- 
lation into axiomatic set theory NBG (see page 269. Prove that the set 
of wfs 38 such that Fnbg 33# is a (proper) recursively axiomatizable 
extension of S. (However, no ‘natural’ set of axioms for this theory is 
known.) 

5.57 Given a set A of natural numbers, let u 6 A* if and only if u is a Godel 
number of a wf 33{x{) and the Godel number of $(Ti) is in A Prove that, if A 
is recursive, then vl* is recursive. 

5.58 Let IC be a consistent theory in the language 3£ a of arithmetic. 

(a) Prove that (7 k)* is not weakly expressible in IC. 

(b) If every recursive set is weakly expressible in K, show that K is re- 
cursively undecidable. 
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(c) If every recursive set is expressible in K, prove that IC is essentially 
recursively undecidable. 

5.5 OTHER NOTIONS OF COMPUTABILITY 


Computability has been treated here in terms of Turing machines because 
Turing’s definition is probably the one that makes clearest the equivalence 
between the precise mathematical concept and the intuitive no lion. 1 We 
already have encountered other equivalent notions: standard Turing com- 
putability and partial recursiveness. One of the strongest arguments for the 
rightness of Turing’s definition is that all of the many definitions that have 
been proposed have turned out to be equivalent. We shall present several of 
these other definitions. 


Herbrand-Godel Computability 

The idea of defining computable functions in terms of fairly simple systems 
of equations was proposed by Herbrand, given a more precise form by 
Godel (1934), and developed in detail by Kleene (1936a). The exposition 
given here is a version of the presentation in Kleene (1952, chap. XI.) 

First let us define the terms. 

1. All variables are terms. 

2. 0 is a term. 

3. If Ms a term, then ( t )' is a term. 

4. If ti, . . . , t„ are terms and f" is a function letter, then f?{t\ ,...,?„) is a 
term. 

For every natural number n, we define the corresponding numeral n as 
follows: (1) 0 is 0 and (2) n + 1 is («|. Thus, every numeral is a term. 

An equation is a formula r ~ s where r and s are terms. A system E of 
equations is a finite sequence r\ = S ] , i ‘2 = 52 , ...,/> = s k of equations such 
that r k is of the form fHt \, . . . , t„). 

The function letter JJ is called the principal letter of the system E. Those 
function letters (if any) that appear only on the right-hand side of equations 
of E are called the initial letters of E; any function letter other than the 
principal letter that appears on the left-hand side of some equations and also 
on the right-hand side of some equations is called an auxiliary letter of E. 
We have two rules of inference: 


*For further justification of this equivalence, see Turing (1936-37), Kleene 
(1952, pp. 317-323, 376-381) and Mendelson (1990). 
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Ri: An equation e 2 is a consequence of an equation e\ by R| if and only if 
e 2 arises from e\ by substituting any numeral n for all occurrences of a 
variable. 

R 2 : An equation e is a consequence by R 2 of equations //"(Hi, . . . , h m ) =p 
and r = s if and only if e arises from r — s by replacing one or more 
occurrences of ///'(Hi , . . . , n m ) in .s' by p, and r = s contains no variables. 

A proof of an equation e from a set B of equations is a sequence eo,...,e„ 
of equations such that e n is e and, if then: (1) e t is an equation of B, 

or (2) e,- is a consequence by R\ of a preceding equation ej(j < /), or (3) e,- is a 
consequence by R 2 of two preceding equations ej and e m (J < /, m < ?'). We 
use the notation B h e to state that there is a proof from B of e (or, in other 
words, that e is derivable from B). 

Example 

Let E be the system 

fl (* 1 ) = (* 1 )' 

fl(Xl,X 2 ) = fy(2,X2,fi(xi)) 

The principal letter of E is /j 2 , // is an auxiliary letter, and /j 3 is an initial 
letter. The sequence of equations 

fi (^'1 ,x 2 )= fi (2, x 2 ,// {xi )) 
fi( %x 2 ) = fl (2 , x 2 , fl (2) ) 

flan) —fi(i, i,fi(i)) 
fKxi) = an 

f}( 2) = (2)' (Le.,/ 1 1 (2) = 3) 

flan) — fl ( 2 > t, 3 ) 

is a proof of fl( 2, 1) = / 3 ( 2, 1,3) from E. 

A number-theoretic partial function cp(x 1 , . . . ,x„) is said to be computed 
by a system E of equations if and only if the principal letter of E is a letter fV 
and, for any natural numbers /q , . . . , k, u p, 

E \~ ff(ki,. . . ,k») —P if and only if <p(£i, . . . , k n ) = p 

The function cp is called Herbrand-Godel-computcible (for short, HG-com- 
putable) if and only if there is a system E of equations by which (p is 
computed. 

Examples 

1. Let E be the system fl(x 1 ) = 0. Then E computes the zero function Z. 
Hence, Z is HG-computable. 

2. Let E be the system fl{x\) = (x*i) / . Then E computes the successor 
function N. Hence, N is HG-computable. 
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3. Let E be the system fP(x 1, . . . , x „ ) = Xj. Then E computes the projection 
function Uf. Hence, U” is HG-computable. 

4. Let E be the system 

/^(xi,0) -*1 

flixu fa)') = Uifruxi))' 


Then E computes the addition function. 

5. Let E be the system 

y?(*i) = () 

=X\ 

The function (p(xi) computed by E is the partial function with domain {0} 
such that <p(0) = 0. For every k ^ 0, E h f\ (k) = 0 and E I- // (k) = k. 
Hence, <p(*i) is not defined for xi 7^ 0. 


Exercises 


(a) What functions are HG-computable by the following systems of 
equations ? 

(i) fl (0) — 0, fl{(xij)=x x 

(ii) /i 2 (^i,0) =x h /?(0,* 2 ) = 0, fHM’fa)') =Ji(xuX2) 

(iii) fi(xi) = 0, f}(x i) = 0' 

(iv) f\(x\ j 0) = *1, f?(x l,(x 2 )') = (ft(x l,X2))', f}(x 1) =/i 2 (xi,A'i) 


(b) 

5.60 

(a) 

(b) 

(c) 


Show that the following functions are HG-computable. 

(i) |*i -X2I 

(ii) *1 ■ *2 

/-■■>. / \ f 0 if * is even 

(m) (p{x) = , . , , 

11 if * is odd 

Find a system E of equations that computes the /(-place function that is 
nowhere defined. 

Let / be an //-place function defined on a finite domain. Find a system 
of equations that computes /. 

If f(x) is an HG-computable total function and g(x) is a partial func- 
tion that coincides with f(x) except on a finite set A, where g is unde- 
fined, find a system of equations that computes g. 


PROPOSITION 5.22 


Every partial recursive function is HG-computable. 



348 


COMPUTABILITY 


Proof 

(a) Examples 1 -3 above show that the initial functions 2, N and U” are HG- 
computable. 

(b) {Substitution rule (IV).) Let (p{x i,...,x„) = 'nWh (m, ••• ,x„), ... , 

.,x„)) where rj, \j/ u have been shown to be HG-computable. 

Let E/ be a system of equations computing t//,-, with principal letter ff\ and 
let E„ l+ i be a system of equations computing >], with principal letter f m+ \. By 
changing indices we may assume that no two of Ei , . . . , E /n+ i have any 
function letters in common. Construct a system E for q> by listing 
Ei,...,E* + i and then adding the equation ^ +2 (*i, . . . ,x„) = 

{fx{x i, . - - ,x„), . . . ,f”(x i, . . . ,x„)). (We may assume that f” +2 does not oc- 
cur in Ei, . . . ,E m +i.) It is clear that, if cp{ki, . . . ,k„) =p, then E I- fj' 
(k[ k„) =p. Conversely, if E _h J% +2 (k u . . . , k„) = p, then E 
h Jf{k l ,...,k n )=p l ,,.. i Eh f”{ki ,...,k„) = p m _ and E h f;" ±l (p y , . . . ,p m ) 
= p. Hence, it readily follows that Ei h/”(/ci, . . . ,k„) = Pi , . . . , E,„ h 
fm {k \ , • ■ ■ M = Pm and E„ I+] h f ” l+ ] (pi , . . . , p„) = p. Consequently, »//, 
{k[ , . . . , kji^j P\i ■ • • 5 > • • • » Pm and tf{pi ? ■ ■ * j Pm) p. So, 

(p(k \, . . . , k, t ) — p. [Hints as to the details of the proof may be found in 
Kleene (1952, chap. XI, especially, pp. 262-270).] Hence, (p is I IG-com- 
putable. 

(c) {Recursion rule (V).) Let 

(p{x i , • ■ • , Xjij 0) ij/{xi , . . . , Xjj ) 

<p(xj, . . . ,X,,,X n _|_i f 1) 0(xi, - • - <p(xi, . . . ,.A'n+l )) 

where i ft and $ are HG-computable. Assume that Ei is a system of equations 
computing \j/ with principal letter /" and that E2 is a system of equations 
computing i) with principal letter Then form a system for computing <p 
by adding to Ei and E2 

f\ (r 1 j ■ ■ ■ 1 1 0) Ry (xj , . . . , x,;) 

/r 1 hi.-". X n , (x, 1+ i )') = fy +2 (xi , . . . , X„ + l ,/" 1 1 (x, , . . . , x„ +1 )) 

(We assume that Ei and E2 have no function letters in common.) Clearly, if 
(p{ky, . . . , k„,k) =p, then E h fy{ky,...,k„,k)=p. Conversely, one can 
prove easily by induction on k that, if E L f{ ,+1 {k \ , . . . , k n , k) = p , then 
<p{ki , . . . ,/r„, k) = p. Therefore, is HG-computable. (The case when the 
recursion has no parameters is even easier to handle.) 

(d) (^-operator rule (VI).) Let (p{x 1 , . . . ,x„) = py{\j/{xi> ... ,x„,y) = 0) 
and assume that 1 ft is HG-computable by a system Ei of equations with 
principal letter /" +1 . By parts (a)-(c), we know that every primitive recursive 
function is HG-computable. In particular, multiplication is HG-comput- 
able; hence, there is a system E2 of equations having no function letters in 
common with Ei and with principal letter f 2 such that E2 h f 2 {k \ , A'2) =p if 
and only if k\-k 2 — p. We form a system E3 by adding to Ei and E2 the 
equations 
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/”+ 1 (x 1 ,...,A-„, 0 ) = 1 


p (Xi, • • • ,X n , (x‘l, ■ - ■ iXn>Xn+v)if\ (Xl, ■ ■ ■ ,X„X n _j_[)) 


H+l 


One can prove by induction that E 3 computes the function 
"T . . . } x n ,y); that is, E 3 F/" + 1 (fci, . . . ,k n ,k) =p if and only if 

\ y<z ^{k \ , . . . ,k„,y) = p. Now construct the system E by adding to E 3 the 
equations 


/4 3 ((*l)'>°,*3) = *3 

C^l? • ’ ■ jX„) f 4 C/3 (xi) ■ ■ ■ J X„ > X|,-J-l ) ) f'i (xi) * ‘ ■ ,X„, (x,l-fl) ),^H+l) 


Then E computes the function <p(xi , . . . ,x„) = py(\j/(xi, . . . , x„ , y) = 0). If py 
(ij/{ki, . . . ,k„,y) = 0) = q, then E 3 F f"p\k u . , k „ , q) = p, where p + 1 = 
l\ y<q Hku • • • , k„,y), and E 3 F /” +1 (k u ..., k n ,q') = 0. Hence, E^ F 
(ki,.. .,k n ) = fi(p,Q,q)- But, EJ- fl (pj 0,q) — q, and so, E \~f”(k u . . . , 
k„) = q. Conversely, if E f"(k\, . . . ,k„) =q, then E F , 0,</) = q, 
where E 3 F / 3 " 11 (ki,... } k n , q) = (m)' and E 3 F /" +1 (&i , . . . ,£„,<f) = 0. 
Hence, Y\ y<g ij/(ki , . . . , k„,y) = m + 1 ± 0 and Y[ y<q ^{k \ , ■ - • , k n ,y) = 0. So, 
\J/(k { , . . . ,k n ,y) for y < q, and ij/(ki, ... ,k n ,q) = 0. Thus, py(\j/(k \ , 
. . . ,&„,y) — 0) = q. Therefore, (p is HG-computable. 

We now shall proceed to show that every HG-computable function is 
partial recursive by means of an arithmetization of the apparatus of Her- 
brand-Godel computability. We shall use the same arithmetization that was 
used for first-order theories (see Section 3.4). (We take the symbol ' to be an 
abbreviation for ff. Remember that r — s is an abbreviation for Apr, s). The 
only individual constant is 0.) In particular, the following relations and 
functions are primitive recursive (see pages 1 92—4): 


FL(x): jr is the Godel number of a function letter 

(3y) y<x (3z) z<x (x = 1 + 8(2>’ ■ 3 Z ) A y > 0 A 2 > 0) 

EVbl(x): x is the Godel number of an expression consisting of a variable 
EFL(x): x is the Godel number of an expression consisting of a function 
letter 

Nu(x): x is the Godel number of a numeral 
Trm(x): jc is the Godel number of a term 
Num(x) = the Godel number of the numeral x 

Argy(x) = the number of arguments of a function letter, /, if x is the Godel 
number of / 

x*y = the Godel number of an expression AB if x is the Godel number of 
the expression A and y is the Godel number of B 
Subst(x,y, u, v): v is the Godel number of a variable x u is the Godel 
number of a term t, y is the Godel number of an expression and x is 
the Godel number of the result of substituting t for all occurrences of jc, 
in ® 
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The following are also primitive recursive: 

Eqt(x): x is the Godel number of an equation: 

/A(x) = 3 A Trm((x) 1 ) A Trm((x) 2 ) A (x) 0 — 99 

(Remember that= is A\ , whose Godel number is 99.) 

Syst(x): x is the Godel number of a system of equations: 

(VyW^Eqt^,) A FL((((x) //(vKl ) 1 ) 0 ) 

Occ(«, ?;): u is the Godel number of a term t or equation S3 and v is the Godel 
number of a term that occurs in t or S3: 

(Trm(w) V Eqt(z/)) A Trm(i’) A (3v) Y<I( (3y) J)<1( (z/ = x *v * y 
V u — x * v V u = v * y V u = v ) 

Consi (u, v): u is the Godel number of an equation e \ , v is the Godel number 
of an equation e 2 , and e-i is a consequence of e\ by rule Re 

Eqt(z/) A Eqt(u) A (3x) v<a (3y)_,, <i; (Nu(y) A Subst(r, u,y,x) A Occ(z/,x)) 

Cons 2 (w,z, v): u, z, v are Godel numbers of equations e \ , e 2 , e^, respec- 
tively, and es is a consequence of e\ and ei by rule R 2 : 

Eqt(zz) A Eqt(z). A Eqt(r) A ->(3x) v <z (EVbl(x) A Occ(z,x)) 

A FL(((z) 1 ) 0 ) A (Vx) 0<x< ^ (2)i) ^FL(((z),),) 

A (V x ).x c «( (z), ) “'FL ( ( (z) 2 ) T ) A Occ((z/) 2 , (z),) 

A [(3y),<„(3 w) w< «((«)2 = V * Wi a^a^ 2»3«« 5***") V . 

((«) 2 = ( Z ),Ai) = 2"3 ( ») , 5^)] 

Ded(«,z): u is the Godel number of a system of equations E and z is the 
Godel number of a proof from E: 

Syst(zz) A (Vx) v < Ul(z) ( ( 3vv) u , < ///(u) (u) w = (z) v 

v ( 3y)j < .x Cons 1 ( {z ) y , (z) T ) V (3y) Jf <x (3v) v <Jt Cons 2 ((z) J „ (z) v , (z)J) 

S h (m,jci, ... ,x„,z) : u is the Godel number of a system of equations E whose 
principal letter is of the form JJ, and z is the Godel number of a proof 
from E of an equation of the form f?(x 1 , . . . ,x„) =p : 

Ded(«,z) A Arg T (((ii) /if(lf)J . 1 ) 1 ) 0 = n A (((z)^ (2) ^ 1 ) 1 ) 0 
= ((( w V/(«)-m)i)o a (Vv )o < x<^A(( 2 )// ( r)- 1 )i)~ , FL(((( z)/’/( 2 )^- 1 )i ) j') 

A Nu (((z) MW , 1 ) 2 ) A ((z^^i ^ 2«(“W»>>>» * 2 3 * 2 Num ^') * 2 7 
* 2 Num hd * 2 7 * • • • * 2 7 * 2 Num( - Tn) * 2 5 

Remember that g(() = 3, $()) = 5 and g ( , ) = 7. 
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U( x) = p.y y<x (Num(y) = ((x) //? ( v )^i) 2 )- (If -X' is the Godel number of a proof 
of an equation r = p, then U(x ) = p .) 


PROPOSITION 5.23 

(ICleene, 1936a) If (p{x i , ... ,x„) is HG-computable by a system of equations 
E with Godel number e , then 

<p(x i, ■ • ■ ,x„) = U(py(S n (e,xi , . . . , A„,y))) 

Hence, every HG-computable function (p is partial recursive, and, if cp is 
total, then (p is recursive. 


Proof 

cp{k\, . . . ,k„) = p if and only if E b fj{k\, . . . ,k„) =p, where fj is the 
principal letter of E. (p(k\ ,... ,k„) is defined if and only if 
(3y)S„(e, k\ , . . . , k„,y). If (p(k \ , . . . , kn) is defined, /ry(S„(<?, k \ , ._. . , k„,y)) is the 
Godel number of a proof from E of an equation fj(k\,. . . ,k n ) —p. Hence, 
U(fiy(S n (e,ki, . . . ,k, t ,y))) =p = <p(ki , . . . 1 k n ). Also, since S„ is primitive 
recursive, py(S„(e ) x\, . . . ,x„,y)) is partial recursive. If <p is total, then 
(Vxi) . . . (Vx„)(3y)S n (c,vi, . . . , x„,y); hence, ^_y(S H (c,jci, . . . ,x n ,y)) is recur- 
sive, and then, so is U(ny(S„(e, jq, . . . ,*„,y))). 

Thus, the class of HG-computable functions is identical with the class of 
partial recursive functions. This is further evidence for Church’s thesis. 


Markov algorithms 

By an algorithm in an alphabet A we mean a computable function 91 whose 
domain is a subset of the set of words of A and the values of which are also 
words in A. If P is a word in A, 91 is said to be applicable to P if P is in the 
domain of 91; if 91 is applicable to P, we denote its value by 9I(P). By an 
algorithm over an alphabet A we mean an algorithm 91 in an extension B of 
Ad Of course, the notion of algorithm is as hazy as that of computable 
function. 

Most familiar algorithms can be broken down into a few simple steps. 
Starting from this observation and following Markov (1954), we select a 
particularly simple operation, substitution of one word for another, as the 
basic unit from which algorithms are to be constructed. To this end, if P and 
Q are words of an alphabet A, then we call the expressions P — > Q and 
P — > • Q productions in the alphabet A. We assume that d and are not 
symbols of A. Notice that P or Q is permitted to be the empty word. P — > Q 


tAn alphabet B is an extension of A if A C B. 
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is called a simple production, whereas P — ► - Q is a terminal production. Let 
us use P —► (•) Q to denote either P — > Q or P — ► • Q. A finite list of pro- 
ductions in A 


Pi -(■) Qi 

P 2 ->(■) Q 2 

■ 

P r -(*) Q, 

is called an algorithm schema and determines the following algorithm 51 
in A. As a preliminary definition, we say that a word T occurs in a word 
Q if there are words U, V (either one possibly the empty word A) such 
that Q — UTV. Now, given a word P in A: (1) We write 51: Pd if none 
of the words Pi , . . . , P,. occurs in P. (2) Otherwise, if m is the least in- 
teger, with 1 < r, such that P w , occurs in P, and if R is the word that 

results from replacing the leftmost occurrence of P m in P by Q wl , then we 
write 

(a) 91 : P h R 

if P m — > (•) Q m is simple (and we say that 51 simply transforms P into R); 

(b) 91 : P I- • R 

if P„, — > (•) is terminal (and we say that 51 terminally transforms P into 
R). We then define 51 : P f= R to mean that there is a sequence 
Ro, Ri , . . . , R* such that: 

(i) P = Ro- 

(ii) R = R*. 

(iii) For O^j^k — 2, 51 : Ry I- Ry+i. 

(iv) Either 51 : R*_i h Ra or 51 : R*_i • Ra. (In the second case, we write 

51 : P H • R- ) 

We set 5I(P) = R if and only if either 51 : P |= ■ R, or 51 : P f= R and 
51 : R □. The algorithm thus defined is called a normal algorithm (or a 
Markov algorithm) in the alphabet A. 

The action of 51 can be described as follows: given a word P, we find the 
first production P„, — ► (•) Q m in the schema such that P,„ occurs in P. We 
then substitute Q,„ for the leftmost occurrence of P,„ in P. Let Ri be the new 
word obtained in this way. If P,„ — » (•) Q m was a terminal production, the 
process stops and the value of the algorithm is Ri. If P m — ► (•) Q m was 
simple, then we apply the same process to Ri as was just applied to P, and so 
on. If we ever obtain a word R, such that 51 : R,H, then the process stops 
and the value 5I(P) is R,-. It is possible that the process just described never 
stops. In that case, 51 is not applicable to the given word P. 

Our exposition of the theory of normal algorithms will be based on 
Markov (1954). 
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Examples 

1. Let A be the alphabet {b, c}. Consider the schema 

b — > ■ A 
c — *• c 

The normal algorithm 31 defined by this schema transforms any word 
that contains at least one occurrence of b into the word obtained by 
erasing the leftmost occurrence of b. 31 transforms the empty word A into 
itself. 31 is not applicable to any non-empty word that does not contain b. 

2. Let A be the alphabet {ao,ai, . . . ,a„}. Consider the schema 

ao -»• A 
ai -» A 

* 

■ 

■ 

a,, -» A 

We can abbreviate this schema as follows: 

£-A (£ in A) 

(Whenever we use such abbreviations, the productions intended may be 
listed in any order.) The corresponding normal algorithm transforms 
every word into the empty word. For example, 

31 : aia2aia3ao F a^a^ I- a 2 aia 3 h a 2 a 3 h a 3 h A and 31 : An. Hence, 
3I(aia2aia3ao) = A. 

3. Let A be an alphabet containing the symbol ai, which we shall abbre- 
viate |. For natural numbers n, we define n inductively as follows: 0 =| 
and n + 1 = n |. Thus, 1 = ||, 2 =|||, and so on. The words n will be called 
numerals. Now consider the schema A — > ■ |, defining a normal algorithm 
31. For any word P in A, 3I(P) =| P2 In particular, for every natural 
number n, 3I(n) = n + 1. 

4. Let A be an arbitrary alphabet (ao,ai, . . . ,a„}. Given a word 
P = a yr 'djj ■ • • a Jk , let P = a j k • ■ ■ a 7] a j 0 be the inverse of P. We seek a nor- 
mal algorithm 31 such that 3I(P) = P. Consider the following (abbrevi- 
ated) algorithm schema in the alphabet B = Au(o!, p}. 

(a) aa — > p 

(b) P£-£P (£ in A) 

(c) P« -> p 

(d) P — > • A 

(e) ar]£-±t;ar\ (^,iiinA) 

(f) A a 

tTo see this, observe that A occurs at the beginning of any word P, since 
P = AP. 
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This determines a normal algorithm in B. Let P — a /0 a 7l • • • a Jk be any 
word in A. Then 51 : P h aP by production (f). Then, aP h ay, aa /0 a j2 ... a/, 
h a J{ a j 2 cm ^ a y 3 . . . a A . . . Ha ;i a jr .. a A aa J0 , all by production (e). Thus, 
51 : P j= a y -,ay 2 . . . a/*aa J0 . Then, by production (f), 51 : P (= aa,-, a,-, 
a Jk aa jo . Applying, as before, production (e), 51 : P (= a j2 a j3 . . . a jk aa Jt aa j(i 
Iterating this process, we obtain 51 : P f= aa jk aaj kX a . . . aa,-, aa ; - 0 . Then, by 
production (f), 51: P (= ma jk aaj k _ 3 a . . . aa^, oca/ 0 , and, by production (a), 
51 : P (= pay t oca/ t _,a . . . aa y -, aa J0 . Applying productions (b) and (c) and fi- 
nally (d), we arrive at 51 : P {= • P. Thus, 51 is a normal algorithm over A 
that inverts every word of Ad 


Exercises 

5.61 Let A be an alphabet. Describe the action of the normal algorithms 
given by the following schemas. 

(a) Let Q be a fixed word in A and let the algorithm schema be: A — > • Q. 

(b) Let Q be a fixed word in A and let a be a symbol not in A. Let 
B=A u{a}. Consider the schema 

— » Ijx (£ in A) 

a — » • Q 
A — » a 

(c) Let Q be a fixed word in A. Take the schema 

$ - A (5 in A) 

A • Q 

(d) Let B = Au{|}. Consider the schema 

S-l ft in A -{|}) 

A — > • | 

5.62 Let A be an alphabet not containing the symbols a, P, y. Let B — 
Au{a} and C = Au{a, P, y}. 

(a) Construct a normal algorithm 51 in B such that 51(A) — A and 
5l(^P) = P for any symbol ^ in A and any word P in A. Thus, 51 erases 
the first letter of any non-empty word in A. 


^The distinction between a normal algorithm in A and a normal algorithm over 
A is important. A normal algorithm in A uses only symbols of A, whereas a normal 
algorithm over A may use additional symbols not in A. Every normal algorithm in A 
is a normal algorithm over A, but there are algorithms in A that are determined by 
normal algorithms over A but that are not normal algorithms in A (for example, the 
algorithm of Exercise 5.62(d)). 
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(b) Construct a normal algorithm X in B such that £>(A) = A and 
X>(P£) — P for any symbol ^ in A and any word P in A. Thus, X) erases 
the last letter of any non-empty word in A. 

(c) Construct a normal algorithm (£ in B such that C(P) equals A if P 
contains exactly two occurrences of a and G(P) is defined and is not 
equal to A in all other cases. 

(d) Construct a normal algorithm in C such that, for any word P of A, 
ffl(P) = PP. 

5.63 Let A and B be alphabets and let a be a symbol in neither A nor B. For 
certain symbols ai , . . . , a* in A, let Qj , . . . , Q A . be corresponding words in B. 
Consider the algorithm that associates with each word P of A the word 
Sub/A 1 "'^ (P) obtained by simultaneous substitution of each Q, for a ,• 

Nil •"»>£* 

(7 = 1, . . . ,k). Show that this is given by a normal algorithm in AuBu{a}. 

5.64 Let H = {(} and M — Every natural number n is represented by 

its numeral 77, which is a word in H. We represent every A:- tuple 
(wi,« 2 , . . - ,«*) of natural numbers by the word 77iB 77 2 B . . . B 77* in M. We 
shall denote this word by • • • ,«*). For example, (3, 1,2) is ||||B||B|||. 

(a) Show that the schema 

B — ► B 

a || — >■ a| 
a| — ► • | 

A —*■ <x 

defines a normal algorithm 2lz over M such that 21^(77) = 0 for any n, and 
21 z is applicable only to numerals in M. 

(b) Show that the schema 

B — ► B 

a I “HI 

A —f a 

defines a normal algorithm 21# over M such that 21# (77) — n - f 1 for all n , 
and 21# is applicable only to numerals in M. 

(c) Let «i , . . - , a. 2 k be symbols not in M. Let 1 < k. Let \ be the list 

«2/-i B — ► a 2 /_ i B 

K2i— L | * ®2/ 1 

&2i\ —*■ <%2 i 


<*2i B — > CC2i+l 
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If 1 < j < k consider If j — 1 , consider If j = k, consider 
the algorithm schema the schema the schema 

oti B —> a,B y, 

: v.\ I — * a? I : 

Y j — | C<2 I — * |(X2 ^ k— 1 

0(2/- 1 B — * <* 2 j- lB «2 B — » OC3 ®2X'— 1 B — > 0 . 2 k- \ B 

«2y-l|— >a 2y| -5^2 (*2*~1 1 — > 0(2*| 

a 2/| — 1 ’ I a 2/ : «2*| — > |«2A- 

a 2/ B — > a2/+ 1 ^ *-i «2 a B — > < 22 * B 

•^/f l <*2*-l B —> (22*- lB «2A- — » • A 

: “2A— 1| — > 00*1 A — » a\ 

y *-l 0£»B| — ► «2A- 

0(2*- 1 B — > 0(2*- lB «2*B — > (22* B 

0(2*- 1 I a 2* 0(2* — > • A 

0t2A-| 0(2* A Of] 

0(2* B — ► C( 2 * B 
0(2* -*■ ■ A 
A — > «i 

Show that the corresponding normal algorithm 91* is such that 

/ — — — I J 

,«*)) = and 91^- is applicable to only words of the form 

(ti j j . • • j 77* ) ■ 

(d) Construct a schema for a normal algorithm in M transforming (n \ , n 2 ) 

into |*7 j — 772 1 . 

(e) Construct a normal algorithm in M for addition. 

(f) Construct a normal algorithm over M for multiplication. 


Given algorithms 51 and 93 and a word P, we write 91 (P) « 93 (P) if and 
only if either 91 and © are both applicable to P and 9I(P) = 93 (P) or neither 
91 nor © is applicable to P. More generally, if C and D are expressions, then 
C « D is to hold if and only if neither C nor D is defined, or both C and D 
are defined and denote the same object. If 91 and © are algorithms over an 
alphabet A, then we say that 91 and © are fully equivalent relative to A if 
and only if 5l(P) » ©(P) for every word P in A; we say that 91 and © are 
equivalent relative to A if and only if, for any word P in A, whenever 9I(P) or 
©(P) exists and is in A, then 9I(P) « ©(P). 

Let M be the alphabet {|,B}, as in Exercise 5.64, and let to be the set of 
natural numbers. Given a partial number-theoretic function tp of k argu- 
ments, that is, a function from a subset of co k into to, we deno te by © c „ the 
corresponding function in M; that is, © (/ ,( ( 771 , . . . ,n k ) ) = tp(rti , . . . , 77 *) 
whenever either of the two sides of the e quation is defi ned. ©^ is assumed to 
be inapplicable to words not of the form (hi, ... ,/?*). The function tp is said 
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to be Markov- computable if and only if there is a normal algorithm 51 over 
M that is fully equivalent to ©^ relative to MA 

A normal algorithm is said to be closed if and only if one of the pro- 
ductions in its schema has the form A — > ■ Q. Such an algorithm can end 
only terminally - that is, by an application of a terminal production. Given 
an arbitrary normal algorithm 51, add on at the end of the schema for 51 the 
new production A — » ■ A, and denote by 51- the normal algorithm deter- 
mined by this enlarged schema. 51- is closed, and 51- is fully equivalent to 51 
relative to the alphabet of 51. 

Let us now show that the composition of two normal algorithms is again 
a normal algorithm. Let 51 and © be normal algorithms in an alphabet A. 
For each symbol b in A, form a new symbol b, called the correlate of b. Let 
A be the alphabet consisting of the correlates of the symbols of A. We 
assume that A and A have no symbols in common. Let a and p be two 
symbols not in AuA. Let Ggi be the schema of 51- except that the terminal 
dot in terminal productions is replaced by a. Let S© be the schema of ©• 
except that every symbol is replaced by its correlate, every terminal dot is 
replaced by p, productions of the form A — > Q are replaced by a — ► aQ, and 
productions A — > • Q are replaced by a aPQ. Consider the abbreviated 
schema 

aa — * aa (a in A) 
aa — ► aa (a in A) 

a b — » a b (a, b in A) 
a p — > p a (a in A) 

P a — ► p a (a in A) 
ab->ab (a, b in A) 
ap — ► • A 
6 © 

<5* 

This schema determines a normal algorithm © over A such that 
©(P) « ©(5I(P)) for any word P in A. © is called the composition of 51 and 
© and is denoted © o 51. In general, by 51, , o . . . o 5li we mean 
51, , o (. . . o ( 5 I 3 o (5t 2 o2(|)).. .). 

Let 5) be an algorithm in an alphabet A and let B be an extension of A. If 
we take a schema for 5) and prefix to it the production b — > b for each 
symbol b in B — A, then the new schema determines a normal algorithm 5) B 
in B such that 5) B (P) ~ 5)(P) for every word P in A, and 5) B is not appli- 

tln this and in all other definitions in this chapter, the existential quantifier 
‘there is' is meant in the ordinary ‘classical’ sense. When we assert that there exists an 
object of a certain kind, we do not necessarily imply that any human being has found 
or ever will find such an object. Thus, a function (p may be Markov-computable 
without our ever knowing it to be so. 
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cable to any word in B that contains any symbol of B - A. 9) B is fully 
equivalent to 9) relative to A and is called the propagation of 9) onto B. 

Assume that 91 is a normal algorithm in an alphabet A\ and © is a 
normal algorithm in an alphabet A 2 . Let A = Aj uA 2 . Let 91 a and 93 A be 
the propagations of 91 and ffi, respectively, onto A. Then the composition 
© of 9 I a and © A is called the normal composition of 91 and © and is 
denoted by © o 91. (When Ai = A 2 , the normal composition of 91 and © is 
identical with the composition of 91 and ©; hence the notation © o 9f i s 
unambiguous.) © is a normal algorithm over A such that ©(P) ~ ©(9I(P)) 
for any word P in Aj , and © is applicable to only those words P of A 
such that P is a word of A ls 91 is applicable to P, and © is applicable to 
9J(P). 

PROPOSITION 5.24 

Let ZT be a Turing machine with alphabet A. Then there is a normal al- 
gorithm 91 over A that is fully equivalent to the Turing algorithm Alg^ 
relative to A. 

Proof 

Let D — (q Ao , . . . , q frm }, where q^, . . . , q^ are the internal states of ZT and 
q jto = q 0 . Write the algorithm schema for 91 as follows: First, for all qua- 
druples q.a,atq,. of take the production q y -a, — ► q,.a*. Second, for each 
quadruple q^a/Lq,. of , take the productions a„q y -a/ — > q r a H a,- for all sym- 
bols a« of A; then take the production q y a, — > q,.aoa,. Third, for each qua- 
druple q a/Rq,. of take the productions q -a f -a n —* a,q^a„ for all symbols a„ 
of A; then take the production q^-a, — > a,q,.ao. Fourth, write the productions 
q^. — » • A for each internal state q* f of and finally take A — » q 0 . This 
schema defines a normal algorithm 91 over A, and it is easy to see that, for 
any word P of A, Alg ^(P) ps 91 (P). 

COROLLARY 5.25 

Every Turing-computable function is Markov-computable. 


Proof 


Let /( jc i be standard Turing-computable by a Turing machine IT 
with alphabet A 3 {|, B}. (Remember that B is ao and | is ai.) We know 
that, for any natural num bers k\,. . . ,k „ , if f {k \ , . . . ,k„) is not defined, then 
Algjr is not applicable to (/q , . . . , k„) , whereas, if f(k\ is defined, 
then 
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Alg.^((fci,... t /t n )) ~ Ri B f{k\,. . . ,£„)R 2 

where Ri and R 2 are (possibly empty) sequences of Bs. Let © be a normal 
algorithm over A that is fully equivalent to Alg r relative to A. Let (6 be the 
normal algorithm over { | , B} determined by the schema 

aB —* a 

a| -» P| 

PI-IP 

PB * By 

vl — PI 

yB-^y 
By — » * A 
P — ► • A 

A — » a 

If Rj and R 2 are possibly empt y sequences of Bs, then G, when applied to 
Ri B f{k \ , . . . , k„) R 2 , will erase Ri and R 2 . Finally, let 2I"|) 

be the normal ‘projection’ algorithm defined in Exercise 5.64(c). Then the 
normal composition 2I"+j o © o S is a normal algorithm that computes /. 

Let 21 be any algorithm over an alphabet A = {a yo , . . . ,ay m }. We can 
associate with 21 a partial number-theoretic function such that 
ij/% j(«) = m if and only if either n is not the Godel number* of a word of A 
and m = 0, or n and m are Godel numbers of words P and Q of A such that 
2I(P) = Q. 

PROPOSITION 5.26 

If 21 is a normal algorithm over A = {a /0 , . . . , a Jm }, then is partial re- 
cursive. 

Proof 

We may assume that the symbols of the alphabet of 21 are of the form a, . 
Given a simple production P Q, we call 2 1 3 0 * P *5 S ^* its index; given a 
terminal production P — » • Q, we let 2 2 3 ff ( p * 5 9 ^ be its index. If 
Po-(0Qo,--->Pr-(-)Q r is an algorithm schema, we let its index be 
2* D 3* i . . . p* r , where ki is the index of P f — > (-)Q f . Let Word(w) be the recur- 
sive predicate that holds if and only if u is the Godel number of a finite 
sequence of symbols of the form a 

•Here and below, we use the Godel numbering of the language of Turing 
computability given in Section 5.3 (p. 321). Thus, the Godel number g(a t ) of a, is 
7 + 4i. In particular, g(B) = g(a 0 ) = 7 and gfl) = </(ai) = 11. 
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u ^ 0 A [u — 1 V (Vz)(z < £fi{ii) => (3y)(y < u A (u) z = 7 + 4_y) ) ] 

Let SI(m) be the recursive predicate that holds when u is the index of a simple 
production: £/i{u) — 3 A («) 0 = 1 A Word((t/)j)A Word((w) 2 ). Similarly, 
TI(w) is the recursive predicate that holds when u is the index of a terminal 
production: £fi{u) = 3 A («) 0 = 2 A Word((t/)j) A Word((«) 2 ). Let Ind(w) 
be the recursive predicate that holds when u is the index of an algorithm 
schema: u > 1 A (Vz)(z < / A(u) => SI((t/) z ) VTI((w) z )). Let Lsub(x,y, e) be 
the recursive predicate that holds if and only if e is the index of a production 
P — > ()Q andx and y are Godel numbers of words U and V such that P occurs 
in U, and V is the result of substituting Q for the leftmost occurrence of P in U: 

Word(x) A Word(y) A (SI(e) V TI(e)) A (3u) w ^ T (3y) t , ^ X (x = u* (e), * v 
A y = u* (e) 2 * v A -i(3w) w < x .(3z) z ^ X (x = w * (e), * z Aw < u )) 

Let Occ(x, y) be the recursive predicate that holds when x and y are Godel 
numbers of words U and V such that V occurs in U: Word(x) A Word(y) 
A(3r) t , ^ v (3z) z ^ X (x = v * y * z). Let End(e, z) be the recursive predicate that 
holds when and only when z is the Godel number of a word P, and e is the 
index of an algorithm schema defining an algorithm 21 that cannot be ap- 
plied to P (i.e., 21 : Pn):lnd(e) A Word(z) A (Vw) u , < ^ (e) -.Occ(z, ((<?)„,),). 
Let SCons(e,j^,x) be the recursive predicate that holds if and only if e is the 
index of an algorithm schema and y and x are Godel numbers of words V 
and U such that V arises from U by a simple production of the schema: 

Ind(e) A YVord(x) A Word(y) A (3t>)„ < ^ (e) [SI((e)J A Lsub(x,y, (e)„) 

A (Vz) z<t ,-iOcc(x, ((c) z ), ) ] 

Similarly, one defines the recursive predicate TCons(e,j,x), which differs 
from SCons(e,;y,x) only in that the production in question is terminal. Let 
Der(e, x,y) be the recursive predicate that is true when and only when e is the 
index of an algorithm schema that determines an algorithm 21, x is the 
Godel number of a word Uq, y is the Godel number of a sequence of words 
LJ 0 , . . . , > 0) such that, for 0 < k — l,U,- + i arises from U,- by a 

production of the schema, and either 21 : h • or 21 : I- U* and 

21 : Ufr Z)(or,if k = 0, just 21 : U*ii): 

Ind(e) A Word(x) A (Vz) z< ^ 0() Word((y) 2 ) A (y) 0 = x 

A ( Vz )z<«(j-)-2 SCons ( e ’ (>')z+o (^) z ) A IVAy) = 1 A End(e, (y) 0 )) 

V (SA(y) > 1 A (TCons(e, (y)^ (y) ^ t , (y) u(y) ^ 2 ) v (SCons (e, (y)^ (y) ^ t , 

(y)f/i{y)^ 2 ) A End ( e > (t)^(j)-i))})] 

Let IVa(u) be the recursive predicate that holds if and only if u is the 
Godel number of a word of A: 

u ± 0 A (u = I V (Vz) z< ^ H) (( w ) z = 7 + 4 Jo V ■ ■ • V (u) z = 7 + 4 jm ) 


Y « 
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Let e be the index of an algorithm schema for 91. Now define the partial 
recursive function <p(x) — A Der(e,A,;y)) V 1 Wa(x)). But 

tf/ m (a) = ((p(x))fj — 1 • Therefore, i// yi is partial recursive. 

COROLLARY 5.27 

Every Markov-computable function <p is partial recursive. 

Proof 

Let 91 be a normal al gorithm over {1, B} such that <p(/q, . . . ,k„) = / if and 
only if 9l((&i, . . . ,k„)) — 7. By Proposition 5.26, the function i is partial 
recursive. Define the recursive function }-'(a) = £/i{x) — 1. If a = II" =0 p \ 1 , 
then n = y(x). (Remember that a stroke |, which is an abbreviation for ai, 
has Godel number 1 1 . So, if a is the Godel number of the numeral n, then 
}>(a) = n.) Let ^{k \ , . . . , k„) be the Godel number of (k \ , . . . , k„): 

Z(ki, .-.,k„) =g((ki,. . = ^(| A|+I B| A ' 2+1 B . . . B|* ,,+1 ) 

*+l *2 + 1 

= 01 *”) • (Av+ 2) 7 ’ (J^CPZ+q+3) 11 ) ‘ {Ph+k'2+5) 7 ■ • ' 

i =0 /=0 

V+l 

• {Ph +—+* k +2h — 3 ) ? • ( H iPi+h +■••+*, +2» ^ 2 ) 1 ' ) 

i=0 

£ is clearly recursive. Then (p = y o ip^ o £ is partial recursive. 

The equivalence of Markov computability and Turing computability 
follows from Corollaries 5.25 and 5.27 and the known equivalence of Turing 
computability and partial recursiveness. Many other definitions of com- 
putability have been given, all of them turning out to be equivalent to 
Turing computability. One of the earliest definitions, A-computability, was 
developed by Church and Kleene as part of the theory of A-conversion (see 
Church, 1941). Its equivalence with the intuitive notion of computability is 
not immediately plausible and gained credence only when A-computability 
was shown to be equivalent to partial recursiveness and Turing comput- 
ability (see Kleene, 1936b; Turing, 1937). All reasonable variations of Tu- 
ring computability seem to yield equivalent notions (see W. Oberschelp, 
1958; Fischer, 1965). 

5.6 DECISION PROBLEMS 

A class of problems is said to be unsolvable if there is no effective procedure 
for solving each problem in the class. For example, given any polynomial 
/(a) with integral coefficients (for example, 3 a 5 — 4a 4 + 7a 2 — 13a + 12), is 
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there an integer k such that f(k) = 0 ? We can certainly answer this question 
for various special polynomials, but is there a single general procedure that 
will solve the problem for every polynomial f{x)l (The answer is given 
below in paragraph 4.) 

If we can arithmetize the formulation of a class of problems and assign to 
each problem a natural number, then this class is unsolvable if and only if 
there is no computable function h such that, if n is the number of a given 
problem, then h(n) yields the solution of the problem. If Church’s thesis is 
assumed, the function h has to be partial recursive, and we then have a more 
accessible mathematical question. 

Davis (1977b) gives an excellent survey of research on unsolvable prob- 
lems. Let us look at a few decision problems, some of which we already have 
solved. 

1. Is a statement form of the propositional calculus a tautology? Truth 
tables provide an easy, effective procedure for answering any such question. 

2. Decidable and undecidable theories. Is there a procedure for deter- 
mining whether an arbitrary wf of a formal system is a theorem of 5^? If 
so, & is called decidable’, otherwise, it is undecidable. 

(a) The system L of Chapter 1 is decidable. The theorems of L are the 
tautologies, and we can apply the truth table method. 

(b) The pure predicate calculus PP and the full predicate calculus PF were 
both shown to be recursively undecidable in Proposition 3.54. 

(c) The theory RR and all its consistent extensions (including Peano 
arithmetic S) have been shown to be recursively undecidable in Cor- 
ollary 3.46. 

(d) The axiomatic set theory NBG and all its consistent extensions arc 
recursively undecidable (see pages 269—70). 

(e) Various theories concerning order structures or algebraic structures 
have been shown to be decidable (often by the method of quantifier 
elimination). Examples are the theory of unbounded densely ordered 
sets (see page 116 and Langford, 1927), the theory of abelian groups 
(Szmielew, 1955), and the theory of real closed fields (Tarski, 1951). For 
further information, consult Kreisel and Krivine (1967, Chap. 4); 
Chang and Keisler (1973, Chap. 1.5); Monk (1976, Chap. 13); Ershov 
et al. (1965); Rabin (1977); and Baudisch et al. (1985). On the other 
hand, the undecidability of many algebraic theories can be derived from 
the results in Chapter 3 (see Tarski, Mostowski and Robinson, 1953, 
II.6, III; Monk, 1976, Chap. 16). 

3. Logical validity. Is a given wf of quantification theory logically valid? 
By Godel’s completeness theorem (Corollary 2.19), a wf is logically valid if 
and only if it is provable in the full predicate calculus PF. Since PF is 
recursively undecidable (Proposition 3.54), the problem of logical validity is 
recursively unsolvable. 
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However, there is a decision procedure for the logical validity of wfs of 
the pure monadic predicate calculus (Exercise 3.59). 

There have been extensive investigations of decision procedures for 
various important subclasses of wfs of the pure predicate calculus; for ex- 
ample, the class (V 3 V) of all closed wfs of the form (Vx ) ( 3y ) ( Vz) ST (x , y, z ) , 
where t%(x,y,z) contains no quantifiers. See Ackermann (1954), Dreben and 
Goldfarb (1980) and Lewis (1979). 

4. Hilbert’s Tenth Problem. If f(x j, ... ,.\'„) is a polynomial with integral 
coefficients, are there integers k\, ... , k„ such that f{k\ , ... , k„) = 0? This 
difficult decision problem is known as Hilbert’s tenth problem. 

For one variable, the solution is easy. When oq, a\, ... , a„ are integers, 
any integer x such that + ... + a\x + — 0 must be a divisor of oq. 

Hence, when c/q ^ 0, we can test each of the finite number of divisors of oq. 
If oq — 0, then x = 0 is a solution. However, there is no analogous proce- 
dure when the polynomial has more than one variable. It was finally shown 
by Matiyasevich (1970) that there is no decision procedure for determining 
whether a polynomial with integral coefficients has a solution consisting of 
integers. His proof was based in part on some earlier work of Davis, Put- 
nam and Robinson (1961). The proof ultimately relies on basic facts of 
recursion theory, particularly the existence of a non-recursive r.e. set 
(Proposition 5.21(e)). An up-to-date exposition may be found in Mat- 
iyasevich (1993). 

5. Word problems. 

(a) Semi-Time Systems. Let B = (bj, . . . ,b„} be a finite alphabet. 
Remember that a word of B is a finite sequence of elements of B. 
Moreover, the empty sequence A is considered a word of B. By a pro- 
duction of B we mean an ordered pair (u,v), where u and v are words of 
B. If p = (u,v) is a production of B, and if w and W are words of B, we 
write w => p W if w' arises from w by replacing a part u of w by v. (Recall 
that u is a part of w if there exist (possibly empty) words wi and W2 such 
that w = W) UW2-) 

By a semi-Thue system on B we mean a finite set ST of productions of B. 
For words w and W of B, we write w =$■<? vv / if there is a finite sequence Wq, 
wi, ... , W/f- (k > 0) of words of B such that w = wo, w' = w*, and, for 
0 < k, there is a production p of ST such that w > => p w,+i. Observe that 

w vv for any word w of B. Moreover, if W] =>#> W2 and W2 W3, then 
W[ =$>& W3. In addition, if wj =$■& W2 and W3 W4, then W1W3 W2W4. 
Notice that there is no fixed order in which the productions have to be 
applied and that many different productions of Sf might be applicable to the 
same word. 

By a Time system we mean a semi-Thue system such that, for every 
production (u,v), the inverse (v,u) is also a production. Clearly, if ST is a 
Thue system and w W, then W w. Hence, =>#• is an equivalence 
relation on the set of words of the alphabet of ST . 
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Example 

Let be the Thue system that has alphabet {b} and productions (b 3 , A) 
and (A, b 3 ). It is easy to see that every word is transformable into b 2 , b, or 
A. 

By a semigroup we mean a non-empty set G together with a binary op- 
eration on G (denoted by the juxtaposition uv of elements u and v) that 
satisfies the associative law x{yz) — ( xy)z . An element y such that 
xy = yx = x for all x in G is called an identity element. If an identity element 
exists, it is unique and is denoted 1. 

A Thue system on an alphabet B determines a semigroup G with an 
identity element. In fact, for each word w of B, let [w] be the set of all words 
W such that w w'. [w] is just the equivalence class of w with respect to 
Let G consist of the sets [w] for all words w of B. If U and V are 
elements of G, choose a word u in U and a word v in V. Let UV stand for 
the set [uv]. This defines an operation on G, since, if u' is any word in U and 
v' is any word in V, [uv] = [uV]. 

Exercises 

5.65 For the set G determined by the Thue system prove: 

(a) (UV)W = U(VW) for all members U, V and W of G. 

(b) The equivalence class [A] of the empty word A acts as an identity 
element of G. 

5.66 

(a) Show that a semigroup contains at most one identity element. 

(b) Give an example of a semigroup without an identity element. 

A Thue system S? provides what is called a finite presentation of the 
corresponding semigroup G. The elements b[, ... , b,„ of the alphabet of 
are called generators, and the productions (u,v) of are written in the form 
of equations u = v. These equations are called the relations of the presen- 
tation. Thus, in the example above, b is the only generator and b 3 = A can 
be taken as the only relation. The corresponding semigroup is a cyclic group 
of order 3. 

If is a semi-Thue or Thue system, the word problem for is the 
problem of determining, for any words w and w / , whether w w'. 

Exercises 

5.67 Show that, for the Thue system in the example, the word problem 
is solvable. 

5.68 Consider the following Thue system Sd. The alphabet is |a,b,c,d} and 
the productions are (ac, A), (ca, A), (bd, A), (db, A), (a 3 , A), (b 2 , A), 
(ab, ba), and their inverses. 
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(a) Show that c a 2 and d b. 

(b) Show that every word of F can be transformed into one of the words a, 
a 2 , b, ab, a 2 b, and A. 

(c) Show that the word problem for F is solvable. [Hint: To show that the 
six words of part (b) cannot be transformed into one another, use the 
cyclic group of order 6 generated by an element g , with a = g 2 and 
b = g 3 . ] 

PROPOSITION 5.30 

(Post, 1947) There exists a Thue system with a recursively unsolvable word 

problem. 


Proof 

Let F be a Turing machine with alphabet {ao,aj, ... , a,,} and internal 
states (q 0? q, , ... , q,„ } . Remember that a tape description is a sequence of 
symbols describing the condition of F at any given moment; it consists of 
symbols of the alphabet of F plus one internal state q,, and q j is not the last 
symbol of the description. F is in state q j, scanning the symbol following q 7 , 
and the alphabet symbols, read from left to right, constitute the entire tape 
at the given moment. We shall construct a semi-Thue system F that will 
reflect the operation of F : each action induced by quadruples of F will be 
copied by productions of F . The alphabet of F consists of 
(ao,ai, ... , SL fh <lo,c[\, The symbol p will be placed at the 

beginning and end of a tape description in order to ‘alert’ the semi-Thue 
system when it is necessary to add an extra blank square on the left or right 

end of the tape. We wish to ensure that, if W i— > W', then pWp PW'P. 

.T 

The productions of F are constructed from the quadruples of F in the 
following manner. 

(a) If q a/a*q r is a quadruple of F, let (q-a*, a r q A .) be a production of F. 

(b) If q^a/Rq,. is a quadruple of F, let (q,a ,a^,fir,q ; .a^) be a production of F 
for every a£. In addition, let (q a,-pa/, q^aoP) be a production of F. (This 
last production adds a blank square when F reaches the right end of 
the tape and is ordered to move right.) 

(c) If q-a,-Lq r is a quadruple of F, let (a^q -a,-, q,.a£a,) be a production of F 
for each a^. In addition, let (Pq^a,-, pq^oa,-) be a production of F. (This 
last production adds a blank square to the left of the tape when this is 
required.) 

(d) If there is no quadruple of F beginning with q y a,, let F contain the 
following productions: (q^a/, S), (dae, d) for all a^; {3 p, £,), (a^, £,) for 
all a£; and (p^,Q. 
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stops when it is in a state q y , scanning a symbol a,-, such that q y a,- does not 
begin a quadruple of 3T. In such a case, would replace q y a r in the final 
tape description of by b. Then b proceeds to annihilate all the other 
symbols to its right, including the rightmost p, whereupon it changes to £ 
then annihilates all symbols to its left, including the remaining p. The final 
result is E, alone. Hence: 

(□) For any initial tape description a, 9 ~ halts when and only when pap £ 

Now, enlarge to a Thue system Sf* by adding to SF the inverses of all 
the productions of Of. Let us show that: 

(V) For any initial tape description a of Pap £if and only if Pap £ 

Clearly, if pap =>&> £, then PaP £,. Conversely, assume for the sake of 
contradiction that paP but it is not the case that pap =$■#> %. Consider 

a sequence of words leading from pap to in SF': 


pap = Wo wi w*_i =$■</ w t = E, 

Here, each arrow is intended to indicate a single application of a production. 
It is clear from the definition of that no production of Sf applies to E, 
alone. Hence, the last step in the sequence w r _ i =>y E, must be the result of a 
production of SF. So, w,_i =>y Working backward, let us find the least p 
such that w p =yy £,. Since we have assumed that it is not true that PaP =>#> 
we must have p > 0. By the minimality of p, it is not true that w p -i =$■&> v/ p . 
Therefore, w p w p _ 1 . Examination of the productions of SF shows that 

each of the words wo, Wj, . . . , w r must contain exactly one of the symbols 
q 0 , qi, ... , q^, <5, or and that, to such a word, at most one production of 

is applicable. But, w p is transformed into both w p+ i and Wp_j by pro- 
ductions of Sf. Hence, w p _i = w p +i. But, w^+i £,. Hence, w p _i 
contradicting the definition of p. This establishes (V). 

Now, let be a Turing machine. with a recursively unsolvable halting 
problem (Proposition 5.14). Construct the corresponding Thue system 9*' as 
above. Then, by (□) and (V), for any tape description a, halts if and only 
if PaP y So, if the word problem for 9' were recursively solvable, the 
halting problem for 9 would be recursively solvable. (The function that 
assigns to the Godel number of a the Godel number of (PaP, £,) is clearly 
recursive under a suitable arithmetization of the symbolism of T uring ma- 
chines and Thue systems.) Thus, 9’ has a recursively unsolvable word 
problem. 

That the word problem is unsolvable even for certain Thue systems on a 
two-element alphabet (semigroups with two generators) was proved by Hall 
(1949). 

(b) Finitely presented groups. A. finite presentation of a group consists of a 
finite set of generators g l5 ... ,g r and a finite set of equations Wi = 
W', , ... , W, = W' between words of the alphabet B = {gj , ... , 


DECISION PROBLEMS 


367 


g n g^ 1 , .. . , g- 1 } . What is really involved here is a Thue system ff with 
alphabet B, productions (W|,Wj), . . . , (W,, W') and their inverses, and all 
the productions (g ? gr l , A), (g/'g,-, A) and their inverses. The corresponding 
semigroup G is actually a group and is called a finitely presented group . The 
word problem for G (or, rather, for the finite presentation of G) is the word 
problem for the Thue system 

Problems that concern word problems for finitely presented groups are 
generally much more difficult than corresponding problems for finitely 
presented semigroups (Thue systems). The existence of a finitely presented 
group with a recursively unsolvable word problem was proved, indepen- 
dently, by Novikov (1955) and Boone (1959). Other proofs have been given 
by Higman (1961), Britton (1963), and McKenzie and Thompson (1973). 
(See also Rotman, 1973.) Results on other decision problems connected with 
groups may be found in Rabin (1958). For corresponding problems in 
general algebraic systems, consult Evans (1951). 





Our treatment of quantification theory in Chapter 2 was confined to first- 
order logic, that is, the variables used in quantifiers were only individual 
variables. The axiom systems for formal number theory in Chapter 3 and set 
theory in Chapter 4 also were formulated within first-order languages. This 
restriction brings with it certain advantages and disadvantages, and we wish 
now to see what happens when the restriction is lifted. That will mean 
allowing quantification with respect to predicate and function variables. 
Emphasis will be on second-order logic, since the important differences 
between first-order and higher-order logics already reveal themselves at the 
second-order level. Our treatment will offer only a sketch of the basic ideas 
and results of second-order logic. 

Let L1C be the first-order language in which C is the set of non-logical 
constants (that is, individual constants, function letters, and predicate let- 
ters). Start with the language L1C, and add function variables g" and pre- 
dicate variables R”, where n and i are any positive integers. t (We shall use 
g", h", . . . to stand for any function variables of n arguments, and 
R”, S", . . . , X", Y", Z H to stand for any predicate variables of n arguments. 
We shall also omit the superscript n when the value of n is clear from the 
context.) Let (u) n stand for any sequence of individual variables ii \ , . . . , uj- 
and let V{«)„ stand for the expression (Vt/i) . . . (Vw„). Similarly, let (t) n stand 
for a sequence of terms t\ , . . . , We expand the set of terms by allowing 
formation of terms g"({/)„), where g" is a function variable, and we then 
expand the set of formulas by allowing formation of atomic formulas 


tWe use bold letters to avoid confusion with function letters and predicate 
letters. Note that function letters and predicate letters are supposed to denote specific 
operations and relations, whereas function variables and predicate variables vary 
over arbitrary operations and relations. 

Tn particular, {x) n will stand for *i, . . . ,x„. 
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and R"((f) n ) where {/)„ is any sequence of the newly enlarged set of 
terms, A ” is any predicate letter of C and R” is any n-ary predicate variable. 
Finally, we expand the set of formulas by quantification ( Vg " ) PA and 
(VR' ! )^ with respect to function and predicate variables. 

Let L2 C denote the second-order language obtained in this way. The 
language L2C will be called a full second-order language. The adjective ‘full’ 
indicates that we allow both function variables and predicate variables and 
that there is no restriction on the arity n of those variables. An example of a 
non-full second-order language is the second-order monadic predicate lan- 
guage in which there are no function letters or variables, no predicate letters, 
and only monadic predicate variables.! 

It is not necessary to take = as a primitive symbol, since it can be defined 
in the following manner. 

DEFINITIONS 

t — u stands for (VR 1 )(R 1 t R'n) 
g " = h" stands for V{x}„(g"({x), ( ) = h" ((*)„)) 

R" = S" stands for V(*)„(R"«x)„) ^ S"«x)„)) 

STANDARD SECOND-ORDER SEMANTICS FOR L2C 

For a given language L2C, let us start with a first-order interpretation with 
domain D. In the first-order case, we defined satisfaction for the set of 
denumerable sequences of members of D. Now, instead of we use the set 
V of functions s that assign to each individual variable a member of Z), to 
each function variable g" some n-ary operation ^(g”) on D, and to each 
predicate variable R" some n-ary relation* x(R") on D. For each such s, we 
extend the denotations determined by 5 by specifying that, for any terms 
fi, . . . , t„ and any function variable g", the denotation s(g"(/i, . . . , t „ )) is 
s(^ , )(s(t\) J ... , s(t„)). The first-order definition of satisfaction is extended as 
follows: 

(a) For any predicate variable R" and any finite sequence (t) n of terms, s 
satisfies R"((l) ;f ) if and only if (s(ti), . . . , s(t n )) € ^(R M ); 

! Third-order logics are obtained by adding function and predicate letters and 
variables that can have as arguments individual variables, function and predicate 
letters, and second-order function and predicate variables, and then allowing 
quantification with respect to the new function and predicate variables. This pro- 
cedure can be iterated to obtain nth-order logics for all n > 1 . 

* An n-ary relation on D is a subset of the set D ” of n-tuples of D. When n — 1, an 
n-ary relation is just a subset of D. 
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(b) 5 satisfies (Vg")^? if and only if s' satisfies 83 for every s' in Y that 

agrees with 5 except possibly at g"; 2 

(c) s satisfies ( V R" ) 83 if and only if s' satisfies 83 for every s' in Y that 

agrees with s except possibly at R". 2 

The resulting interpretation Ji is called a standard interpretation of the 
given language. 

A formula 83 is said to be true for a standard interpretation Ji (written 
Ji |= 83) if 83 is satisfied by every .v in Y ■ 83 is false for Ji if no sequence s in 
Y satisfies 83. 

'A formula 83 is said to be standardly valid if 83 is true for all standard 
interpretations. 83 is said to be standardly satisfiable if 83 is satisfied by some 
s in Y i n some standard interpretation. A formula ^ is said to be a stan- 
dard logical consequence of a set T of formulas if, for every standard in- 
terpretation, every s in Y that satisfies every formula in T also satisfies ( 8. 
A formula 83 is said to standardly logically imply a formula 8? if ^ is a logical 
consequence of { 83 }. 

The basic properties of satisfaction, truth, logical consequence, and lo- 
gical implication that held in the first-order case (see ( I)-(XI) on pp. 61—3) 
also hold here for their standard versions. In particular, a sentence 83 is 
standardly satisfiable if and only if 83 is true for some standard interpreta- 
tion. 

We shall see that second-order languages have much greater expressive 
power than first-order languages. This is true even in the case where the set 
C of non-logical constants is empty. The corresponding language L20 will be 
denoted L2 and called the pure full second-order language. Consider the 
following sentence in L2. 

(1) (3g)(3x)(VR)[(R(x) A (Vy)(R(y) =* R(g O')))) =* (Vx)R(x)] 

This sentence is true for a standard interpretation if and only if the domain 
D is finite or denumerable. To see this, consider an operation y and element 
x given by this sentence. By induction, define the sequence x, y(x), 
g(y(x)), g(g(g(x ))) , . . . , and let R be the set of objects in this sequence. R is 
finite or denumerable, and (1) tells us that every object in D is in R. Hence, 
D — R and D is finite or denumerable. Conversely, assume that D is finite or 
denumerable. Let F be a one-one function from D onto co (when D is 
denumerable) or onto an initial segment {0, 1, . .., n} of co (when D is 
finite).! Let je^F^O) and define an operation g on D in the following 
manner. When D is denumerable, g(u) — F~ l (F(u) + 1) for all it in D; when 
D is finite, let g(u) — F~ l (F(u) + 1)) if F{u) < n and g(u ) — x if F{u) — n. 
With this choice of g and x, (1) holds. 


t Remember that the domain of an interpretation is assumed to be non-empty. 
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Exercise 

A.l Show that there is no first-order sentence $ such that (ft is true in an 
interpretation if and only if its domain is finite or denumerable. [Hint: 
Use Corollary 2.22.] 

Let us introduce the abbreviations Y 1 CX 1 for (Vm)(Y 1 (i/) 
NonEm(X* ) for (3w)(X 1 (w)), and Asym(R 2 , X 1 ) for (Vm)(Vu)(X 1 (w) AX’(d) 
A R 2 (u, v) => ->R 2 (r, it)). Let R 2 We X 1 stand for the second-order formula 

Asym(R 2 , X 1 ) A (VY 1 )^ 1 C X 1 A NonEmtY 1 ) 

=>■ (3w)(Y , (t/) A (Vi^Y 1 ^) A v ^ u => R 2 («, r)))) 

Then R 2 We X 1 is satisfied by an assignment in a given standard inter- 
pretation if and only if the binary relation assigned to R 2 well-orders the set 
assigned to X 1 . 

Let Suc(«, v, R 2 ) stand for R 2 (r, a) A (Vw)-'(R 2 (t>, w) A R 2 (w, «)), and 
let First (w, R 2 ) stand for (Vv)(v ^ a R 2 (w, r)). Consider the following 
second-order formula. 

(2) (3R 2 )(3X , )(R 2 We X 1 A (V«)X l («) A (V«)(-iFirst(i/, R 2 ) 

=> (3u)Suc(tt, v, R 2 )) A (3«)(Vi>)(t> 7 ^ 11 =>• R 2 (t>, «))) 

This is true for a standard interpretation if and only if there is a well- 
ordering of the domain in which every element other than the first is a 
successor and there is a last element. But this is equivalent to the domain 
being finite. Hence, (2) is true for a standard interpretation if and only if its 
domain is finite. 

Exercise 

A .2 (a) Show that, for every natural number n, there is a first-order sentence 
the models of which are all interpretations whose domain contains at 
least n elements, (b) Show that, for every positive integer /?, there is a 
first-order theory the models of which are all interpretations whose 
domain contains exactly n elements, (c) Show that there is no first-order 
sentence (ft that is true for any interpretation if and only if its domain is 
finite. 

The second-order sentence (1) A -«(2) is true for a standard interpretation 
if and only if the domain is denumerable. 


Exercises 

A.3 Show that there is no first-order sentence (% the models of which are all 
interpretations whose domain is denumerable. 
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A .4 Construct a second-order formula Den(X ] ) that is satisfied by an as- 
signment in a standard interpretation if and only if the set assigned to 
X 1 is denumerable. 


SECOND-ORDER THEORIES 


We define a second-order theory in a language L2C by adding the following 
new logical axioms and rules to the fust-order axioms and rules. 

(B4a) (VR M )^(R") =^(W W ), where <%(W) arises from #(R") by 
replacing all free occurrences of R” by W" and W" is free for R" in 

(B4b) (V g u )3d(g n ) => 3d(h n ), where 3?(h") arises from 3d(g n ) by replacing all 
free occurrences of g'' by h" and h" is free for g" in 3d(g"). 

(B5a) (VR")(3? => => {3d (V R")^), where R" is not free in 3d. 

(B5b) {'3g"){38 => *&) => {3d => (Vg")^), where g” is not free in 3d. 

COMPREHENSION SCHEMA (COMP) 

(3R")(V(x), ; )(R"((x) /I ) 44 3d), provided that all free variables of 3d occur in 
(x) n and R” is not free in 3d. 


FUNCTION DEFINITION SCHEMA ( FUNDEF ) 


(VR" +1 )[(VW„)(3 0 .)R” +, (W„,J') =S- (38")(VW„)R” +, ((a>,„ g”(W„))l 


W + l 


w+l 


NEW RULES 

(Gen2a) (y\L n )3d follows from 3d 
(Gen2b) (Vg n )3d follows from 3d 

Exercises 

A.5 Show that we can prove analogues of the usual equality axioms (A6)- 
(A7) in any second-order theory: 

(i) |- t = t A g" = g" A R" = R" 

(ii) I- t = s =4* ( 3d{t , t ) =?► 3d{t , s)), where 3d(t , 5 ) arises from 3d(t , t ) by re- 
placing zero or more occurrences of t by 5 , provided that 5 is free for t in 
3d{t, t). 
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(iii) b g" = h H => (^(g", g”) =>- ^(g", h M )), where ^?(g", h”) arises from 
^(g", g") by replacing zero or more occurrences of g" by h", provided 
that h" is free for g" in ,^(g", g”). 

(iv) HR" = S" => (^(R", R") ^(R", S")), where £%(R", S") arises from 

0§(R n , R") by replacing zero or more occurrences of R” by S w , provided 
that S" is free for R" in <0(R", R"). 

A.6 Formulate and prove a second-order analogue of the first-order de- 
duction theorem (Proposition 2.5). 

Let PC2 denote the second-order theory in the language L2C without any 
non-logical axioms. PC2 is called a second-order predicate calculus. 

PROPOSITION A.l (SOUNDNESS) 

Every theorem of PC2 is standardly valid. 


Proof 

That all the logical axioms (except Comp and FunDef) are standardly valid 
and that the rules of inference preserve standard validity follow by argu- 
ments like those for the analogous first-order properties. The standard va- 
lidity of Comp and FunDef follows by simple set-theoretic arguments. 

We shall see that the converse of Proposition A.l does not hold. This will 
turn out to be not a consequence of a poor choice of axioms and rules, but 
an inherent incompleteness of second-order logic. 

Let us consider the system of natural numbers. No first-order theory will 
have as its models those and only those interpretations that are isomorphic 
to the system of natural numbers. f However, a second-order characteriza- 
tion of the natural numbers is possible. Let AR2 be the conjunction of the 
axioms (S1)-(S8) of the theory S of formal arithmetic (see p. 155), and the 
following second-order principle of mathematical induction: 

(2S9) (\/R l )[R\0) A (Vx)(R l (x) =► R l (x')) =* (Vx)R' (a-)] 

Notice that, with the help of (Comp), all instances of the first-order axiom 
schema (S9) can be derived from (2S9)b 

tLet K be any first-order theory in the language of arithmetic whose axioms are 
true in the system of natural numbers. Add a new individual constant b and the 
axioms b ^ n for every natural number «. The new theory K* is consistent, since any 
finite set of its axioms has a model in the system of natural numbers. By Proposition 
2.17, K* has a model, but that model cannot be isomorphic to the system of natural 
numbers, since the object denoted by b cannot correspond to a natural number. 

*In AR2, the function letters for addition and multiplication and the associated 
axioms (S5)-(S8) can be omitted. The existence of operations satisfying (S5)-(S8) can 
then be proved. See Mendelson (1973, Sections 2.3 and 2.5). 
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For any standard interpretation that is a model of AR2 we can prove the 
following result that justifies inductive definition. 


PROPOSITION A.2 (ITERATION THEOREM) 

Let Jl be a standard interpretation that is a model of AR2, and let D be the 
domain of Jl. Let c be an element of an arbitrary set W and let g be a 
singulary operation of W. Then there is a unique function F from D into W 
such that 27(0) = c and (Vx)(x € D =>■ F(x') = g(F(x)))J 


Proof 

Let ^ be the set of all subsets H of D x W such that ( 1 , c) e H and 
(Vx) (Vw) ( (x, w) € H =>• (x 1 , £/(w)) E H). Note that D x W E < €. Let F be the 
intersection of all sets H in We leave it to the reader to prove the fol- 
lowing assertions: 

(a) F e<£ 

(b) F is a function from D into W. [Hint: Let B be the set of all x in D for 
which there is a unique w in W such that (x, w) 6 F. By mathematical 
induction, show that B — D.] 

(c) F(l) = c. 

(d) F(x') — g(F(x)) for all x in D. 

The uniqueness of F can be shown by a simple application of mathematical 
induction. 


PROPOSITION A.3 (CATEGORICITY OF AR2) 

Any two standard interpretations Ji and JC that are models of AR2 are 
isomorphic. 


Proof 

Let D and D* be the domains of Jl and Jl*, 0 and 0* the respective zero 
elements, and f and /* the respective successor operations. By the iteration 
theorem applied to Jl, with W = D *, c — 0* and g=f*, we obtain a 
function F from D into D* such that A(0) = 0* and F( f(x)) = f*(F(x)) for 
any x in D. An easy application of mathematical induction in Jl* shows that 
every element of D* is in the range of F. To show that F is one-one, apply 

tin order to avoid cumbersome notation, ‘0’ denotes the interpretation in Jl of 
the individual constant ‘O’, and V’ denotes the result of the application to the object 
jc of the interpretation of the successor function. 
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mathematical induction in J/ to the set of all * in D such that 
060 [()> € D A y ^ a:) => F(x ) ^ 

Let s3 consist of the non-logical constants of formal arithmetic (zero, 
successor, addition, multiplication, equality). Let JT be the standard inter- 
pretation of L2 sd with the set of natural numbers as its domain and the 
usual interpretations of the non-logical constants. 


PROPOSITION A .4 

Let be any formula of \31<4. Then 3ft is true in Jf if and only if AR2 => 38 
is standardly valid. 

Proof 

Assume AR2 =^- 38 is standardly valid. So, AR2 ^ 3ft is true in Jf . But AR2 
is true in A r . Hence, 38 is true in Jf . Conversely, assume 38 is true in J r . We 
must show that AR2 =>38 is standardly valid. Assume that AR2 is true in 
some standard interpretation .3/ of L2,«/. By the categoricity of AR2, M is 
isomorphic to j\ r . Therefore, since 3ft is true in 3ft is true in Ji . Thus, 
AR2 => 38 is true in every standard interpretation of L2s3, that is, 
AR2 =>38 is standardly valid. 


(a) The set SV of standardly valid formulas of L2 is not effectively 
enumerable. 

(b) SY is not recursively enumerable, that is, the set of Godel numbers of 
formulas in SY is not recursively enumerable. 


Proof 

(a) Assume that SV is effectively enumerable. Then, by Proposition A4, we 
could effectively enumerate the set 3338. of all true formulas of first- 
order arithmetic by running through SV, finding all formulas of the 
form AR2 => 38, where 38 is a formula of first-order arithmetic, and 
listing those formulas 38. Then the theory 8338 would be decidable, 
since, for any closed formula < 3, we could effectively enumerate 3338. 
until either ( 3> or its negation appears. By Church’s thesis, 33 3ft would be 
recursively decidable, contradicting Corollary 3.46 (since 33 3ft- is a 
consistent extension of RR). 

(b) This follows from part (a) by Church’s thesis. 

^Details of the proof may be found in Mendelson (1973, Section 2.7). 
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The use of Church’s thesis in the proof could be avoided by a consistent 
use of recursion-theoretic language and results. The same technique as the 
one used in part (a), together with Tarski’s theorem (Corollary 3.44), would 
show the stronger result that the set (of Godel numbers) of the formulas in 
SY is not arithmetical. 


COROLLARY A.6 

The set of all standardly valid formulas is not effectively (or recursively) 
enumerable. 


Proof 

An enumeration of all standardly valid formulas would yield an enumera- 
tion of all standardly valid formulas of L2«s/, since the set of formulas of 
L2sJ is decidable (recursively decidable). 


COROLLARY A.7 

There is no axiomatic formal system whose theorems are the standardly 
valid formulas of L2^/. 


Proof 

If there were such an axiom system, we could enumerate the standardly valid 
formulas of L2stf, contradicting Corollary A.5. 


PROPOSITION A.8 (INCOMPLETENESS 
OF STANDARD SEMANTICS) 

There is no axiomatic formal system whose theorems are all standardly valid 
formulas. 


Proof 

If there were such an axiom system, we could enumerate the set of all 
standardly valid formulas, contradicting Corollary A.6. 

Proposition A.8 sharply distinguishes second-order logic from first-order 
logic, since Godel’s completeness theorem tells us that there is an axiomatic 
formal system whose theorems are all logically valid first-order formulas. 
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Here are some additional important properties enjoyed by first-order the- 
ories that do not hold for second-order theories. 

(I) Every consistent theory has a model. To see that this does not hold 
for second-order logic (with ‘model’ meaning ‘model in the sense of the 
standard semantics’), add to the theory AR2 a new individual constant b. 
Let ff be the theory obtained by adding to AR2 the set of axioms b^n for 
all natural number n. J is consistent. (Any proof involves a finite number of 
the axioms b ^n. AR2 plus any finite number of the axioms b ^ n has the 
standard interpretation as a model, with b interpreted as a suitable natural 
nmnber. So, every step of the proof would be true in Jf . Therefore, a 
contradiction cannot be proved.). But has no standard model. (If Ji were 
such a model, AR2 would be true in Ji. Hence, Ji would be isomorphic to 
Jf and so, the domain of Ji would consist of the objects denoted by the 
numerals h. But this contradicts the requirement that the domain of Ji 
would have to have an object denoted by l b > that would satisfy the axioms 
b for all natural numbers n.) 

(II) The compactness property: a set T of formulas has a model if and 
only if every finite subset of T has a model. A counterexample is furnished 
by the set of axioms of the theory in ( I ) above. 

(III) The upward Skolem-Lowenheim theorem: every theory that has an 
infinite model has models of every infinite cardinality. In second-order logic 
this fails for the theory AR2. By Proposition A. 3, all models of AR must be 
denumerable. 

(IV) The downward Skolem— Lowenheim theorem: every model Ji of a 
theory has a countable elementary submodel!. In second-order logic, a 
counterexample is furnished by the second-order categorical theory for the 
real number system.! Another argument can be given by the following 
considerations. We can express by the following second-order formula 
^(Y^X 1 ) the assertion that Y 1 is equinumerous with the power set of X 1 : 

(3R 2 )[(V*i)(Vx2)(X , (xi) A X 1 ^) A (VyXY^y) =► [R 2 (x u y) O 
R 2 (x 2 ,y)]) =► xi =x 2 ) A (VW')(W’ CY'^(3 x)(X* (x)A 
(Vy)(W'(y)oR 2 (xy))))] 

R 2 correlates with each x in X 1 the set of all y in Y 1 such that R 2 (x,y). Now 
consider the following sentence Cont: 

(3X')(3Y l )(Den(X 1 ) A(Vy)Y 1 (y) A^(Y',X')) 


'For a definition of elementary submodel , see Section 2.13. 

+The axioms are those for an ordered field (see p.99) plus a second-order 
completeness axiom. The latter can be taken to be the assertion that every nonempty 
subset that is bounded above has a least upper bound (or, equivalently, that no 
Dedekind cut is a gap). For a proof of categoricity, see Mendelson [1973], Section 
5.4. 
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Then Cont is true in a standard interpretation if and only if the domain of 
the interpretation has the power of the continuum, since the power set of a 
denumerable set has the power of the continuum. See Shapiro (1991, Section 
5.1.2) and Garland (1974) for more information about the definability of 
cardinal numbers in second-order logic. 

Exercises 

A.7 Show that a sentence of pure second-order logic is true in a standard 
interpretation Jt if and only if it is true in any other standard inter- 
pretation whose domain has the same cardinal number as that of Ji. 
A.8 (a) Show that there is a formula Cont (X 1 ) of pure second-order logic 

that is satisfied by an assignment in an interpretation if and only if 
the set assigned to X 1 has the power of the continuum. 

(b) Find a sentence CH of pure second-order logic that is standardly 
valid if and only if the continuum hypothesis is trued 


HENKIN SEMANTICS FOR L2C 


In light of the fact that completeness, compactness and the Skolem-Ldw- 
enheim theorems do not hold in second-order logic, it is of some interest 
that there is a modification of the semantics for second-order logic that 
removes those drawbacks and restores a completeness property. The fun- 
damental ideas sketched below are due to Henkin (1950). 

Start with a first-order interpretation with domain D. For each positive 
integer n, choose a fixed collection S)(n) of n- ary relations on D, and a fixed 
collection ^(n) of n- ary operations on D. Instead of YY , we now use the set 
of assignments s in Yh such that, for each predicate variable R", ^(R") 
is in <£>(n) and, for each function variable g", ,v(g") is in J F (n). The definitions 
of satisfaction and truth are the same as for standard semantics, except that 

Y^ is replaced by X) • Such an interpretation will be called a Henkin 

.2 2 ^ ^ 

interpretation. Using a Henkin interpretation amounts to restricting the 
ranges of the predicate and function variables. For example, the range of a 
predicate variable R 1 need not be the entire power set {?(D) of the domain 
D. In order for a Henkin interpretation to serve as an adequate semantic 
framework, we must require that all instances of the comprehension schema 
and the function definition schema are true in . A Henkin interpretation 


t\Ve take as the continuum hypothesis the assertion that every subset of the set 
of real numbers is either finite or denumerable or is equinumerous with the set of all 
real numbers. 
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for which this condition is met will be called a general model. A formula that 
is true in all general models will be said to be generally valid, and a formula 
that is satisfied by some assignment in some general model will be said to be 
generally satisfiable. We say that 88 generally implies if 88 =>■ ^ is generally 
valid, and that 88 is generally equivalent to ^ if 88 is generally valid. 

A standard interpretation on a domain D determines a corresponding 
general model in which 88 {n) is the set of all n- ary relations on D and 8F (n) is 
the set of all n- ary operations on D. Such a general model is called a full 
general model. Standard satisfaction and truth are equivalent to Henkin 
satisfaction and truth for the corresponding full general model. Hence, the 
following statements are obvious. 

PROPOSITION A.9 

(a) Every generally valid formula is also standardly valid. 

(b) Every standardly satisfiable formula is generally satisfiable. 

We also have the following strengthening of Proposition Al. 

PROPOSITION A.10 

Every theorem of PC2 is generally valid. 

Proof 

The general validity of (Comp) and (FunDef ) follows from the definition of 
a general model. The proofs for the other logical axioms are similar to those 
in the first-order case, as is the verification that general validity is preserved 
by the rules of inference. 


PROPOSITION A.ll (GENERAL SECOND-ORDER 
COMPLETENESS) 

The theorems of PC2 coincide with the generally valid formulas of L2C. 

Proof 

Let 88 be a generally valid formula of L2C. We must show that 88 is a 
theorem of PC2. (It suffices to consider only closed formulas.) Assume, for 
the sake of contradiction, that 88 is not a theorem of PC2. Then, by the 
deduction theorem, the theory PC2 -I- {-*88} is consistent. If we could prove 
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that any consistent extension of PC2 has a general model, then it would 
follow that PC2 + {->&} has a general model, contradicting our hypothesis 
that is generally valid. Hence, it suffices to establish the following result. 


HENKIN’S LEMMA 

Every consistent extension ST of PC2 has a general model. 


Proof 

The strategy is the same as in Henkin’s proof of the fact that every con- 
sistent first-order theory has a model. One first adds enough new individual 
constants, function letters and predicate letters to provide ‘witnesses’ for all 
existential sentences. For example, for each sentence (3 x)^’(jc) there will be a 
new individual constant b such that (3x)^(x) =4- (b) can be consistently 

added to the theory. (See Lemma 2.15 for the basic technique.) The same 
thing is done for existential quantifiers (3g") and (3R"). Let be the 
consistent extension of 2T obtained by adding all such conditionals as ax- 
ioms. Then, by the method of Lindenbaum’s lemma (Lemma 2.14), we 
inductively extend ?T* to a maximal consistent theory !T ^ . A general model 
Ji of & can be extracted from ST# . The domain consists of the constant 
terms of iT# . The range of the predicate variables consists of the relations 
determined by the predicate letters of . A predicate letter B determines 
the relation B # such that B#(t) n holds in Ji if and only if B#(t) n is a theorem 
of . The range of the function variables consists of the operations de- 
termined by the function letters of iF* . If f is a function letter of iT#, define 
an operation by letting f#((t)„) = A proof by induction shows 

that, for every sentence ^ is true in Ji if and only if ^ is a theorem of 
er*. In particular, all theorems of ZT are true in Ji. 

The compactness property and the Skolem-Lowenheim theorems also 
hold for general models. See Manzano (1996, Chapter IV), or Shapiro 
(1991) for detailed discussions.! 


COROLLARY A.12 

There are standardly valid formulas that are not generally valid. 


f Lindstrom (1969) has shown that, in a certain very precise sense, first-order 
logic is the strongest logic that satisfies the countable compactness and Skolem- 
Lowenheim theorems. So, general models really are disguised first-order models. 



APPENDIX 


381 


Proof 

By Corollary A.7, there is no axiomatic formal system whose theorems are 
the standardly valid formulas of \Jlsd. By Proposition A. 1 1 , the generally 
valid formulas of \2sA are the theorems of the second-order theory PsP2. 
Hence, the set of standardly valid formulas of lPlsA is different from the set 
of generally valid formulas of \P2sA . Since all generally valid formulas are 
standardly valid, there must be some standardly valid formula that is not 
generally valid. 

We can exhibit an explicit sentence that is standardly valid but not 
generally valid. The Godel-Rosser incompleteness theorem (Proposition 
3.38) can be proved for the second-order theory AR2. Let 01 be Rosser’s 
undecidable sentence for AR2d If AR2 is consistent, PA. is true in the stan- 
dard model of arithmetic. (Recall that PA, asserts that, for any proof in AR2 
of 0i, there is a proof in AR2, with a smaller Godel number, of -'PA.. If AR2 
is consistent, PA is undecidable in AR2 and, therefore, there is no proof in 
AR2 of 0t , which makes PA. trivially true.) Hence, AR2 ^ PA is standardly 
valid, by Proposition A.4. However, AR2 => PA is not generally valid. For, if 
AR2 =P PA. were generally valid, it would be provable in Pj</ 2, by Proposi- 
tion A. 1 1 . Hence, PA, would be provable in AR2, contradicting the fact that it 
is an undecidable sentence of AR2. 


Exercise 

A.9 (a) Show that the second-order theory AR2 is recursively undecid- 
able. 

(b) Show that the pure second-order predicate calculus V$42 is re- 
cursively undecidable. + 

It appears that second-order and higher-order logics were the implicitly 
understood logics of mathematics until the 1920s. The axiomatic char- 
acterization of the natural numbers by Dedekind and Peano, the axiomatic 
characterization of the real numbers as a complete ordered field by Hilbert 
in 1900, and Hilbert’s axiomatization of Euclidean geometry in 1902 (in the 
French translation of his original 1899 book) all presupposed a second- 
order logic in order to obtain the desired categoricity. The distinction be- 
tween first-order and second-order languages was made by Lowenheim 
(1915) and by Hilbert in unpublished 1917 lectures, and was crystal-clear in 


tWe must assume that AR is consistent. 

iThe pure second-order monadic predicate logic MP2 (in which there are no 
nonlogical constants and no function variables, and all second-order predicate 
variables are monadic) is recursively decidable. See Ackermann (1954) for a proof. 
The earliest proof was found by Lowenheim (1915), and simpler proofs were given 
by Skolem (1919) and Behmann (1922). 
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Hilbert and Ackermann’s (1950),* where the problem was posed about the 
completeness of their axiom system for first-order logic. The positive solu- 
tion to this problem presented in Godel (1930), and the compactness and 
Skolem -Lowenheim theorems that followed therefrom, probably made the 
use of first-order logic more attractive. Another strong point favoring first- 
order logic was the fact that Skolem in 1922 constructed a first-order system 
for axiomatic set theory that overcame the imprecision in the Zermelo and 
Fraenkel systems.* Skolem was always an advocate of first-order logic, 
perhaps because it yielded the relativity of mathematical notions that Sko- 
lem believed in. Philosophical support for first-order logic came from W.V. 
Quine, who championed the position that logic is first-order logic, and that 
second-order logic is just set theory in disguise. 

The rich lodes of first-order model theory and proof theory kept logicians 
busy and satisfied for over a half-century, but recent years have seen a 
revival of interest in higher-order logic and other alternatives to first-order 
logic, and the papers in the book Model-Theoretic Logics (edited by Barwise 
and Feferman (1985)) offer a picture of these new developments. 6 Barwise 
(1985) lays down the challenge to the old first-order orthodoxy, and Shapiro 
(1991) and Corcoran (1987) provide philosophical, historical and technical 
support for higher-order logic. Of course, we need not choose between first- 
order and higher-order logic; there is plenty of room for both. 


* Hilbert and Ackermann (1950) is a translation of the second (1938) edition of a 
book which was first published in 1928 as Grundziige der theoretischen Logik. 

+See Moore (1988) and Shapiro (1991) for more about the history of first-order 
logic. Shapiro (1991) is the most reliable and thorough study of the controversies 
involving first-order and second-order logic. 

®Van Benthem and Doets (1983) also provides a high-level survey of second- 
order logic and its ramifications. 


CHAPTER 1 
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1.1 A B 

T T F 

F T T 

TFT 
F F F 

1.2 A B ~\A A^B (A B)\J ~\A 

T T F T T 

F T T T T 

T F F F F 

F F T T T 

1.3 ((A=>B)AA) 

f j * r | i f| t i * w j * 

FTT FF 
T F F F T 
FTF FF 

1.4 (a) ((A => (-*)) A (M) =► ^B))) 

(c) (A => B), A: x is prime, B: x is odd. 

(d) (A => B), A: the sequence s converges, 

B: the sequence s is bounded. 

(e) (A & (BA (CAD))) 

A: the sheikh is happy, 

B: the sheikh has wine, 

C : the sheikh has women, 

D : the sheikh has song. 

(f) (A =>B), A: Fiorello goes to the movies. 

(i) ((~>A) =>■ B ), A: Kasparov wins today, 

B: Karpov will win the tournament. 

1.5 (c), (d), (f), (g), (i), (j) are tautologies. 
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ANSWERS TO SELECTED EXERCISES 


1.6 (a), (b), (d), (e), (f) are logically equivalent pairs. 

1.11 All except (i). 

1.13 Only (c) and (e). 

1.15 (a) (B => ^A) A C (e) A B ^ -.(C V D) 

(c) Drop all parentheses, (g) ->(-i->(B V C) ^ (B Q) 

1.16 (a) (CV(M)AB)) (c) ((C=^((-^((AVB)^C))AA))^B) 

1.17 (a) (((-i(-i^)) (PV C)) (d) and (f ) are the only ones that are 

not abbreviations of statement forms. 

1.18 (a) V => C~>AB and VC=^ A B—<DC 

(c) (a) A B^AC (b) V A V BC 

(d) (i) is not. (ii) (A => B) {{B =* C) ^ (-. A =4> C)) 

1.19 (f) is contradictory, and (a), (d), (e), (g)-(j) are tautologies. 

1.20 (b)-(d) are false. 

1.21 (a) T (b) T (c) indeterminate 

1.22 (a) A is T, B is F, and ->A V (A =>■ B) is F. 

(c) A is T, C is T, B is T. 

1.29 (c) (i) A a((£aC) V a -q) (ii) A /\B t\ ->C 

(iii) -A V (-.5 A C) 

1.30 (a) If & is a tautology, the result of replacing all statement letters by 

their negations is a ; tautology, if we then move all negation signs 
outward by using Exercise 1 .27 (k) and (1), the resulting tautology 
is -i^. Conversely, if is a tautology, let ^ be ->{%'. By the first 
part, is a tautology. But is 
(c) (— i A A —iB A —i iC) V (A AB A -tD) 

1.32 (a) For figure 1.4: 




~i B 


1.33 (a), (d) and (h) are not correct, 

1.34 (a) Satisfiable: Let A, B, and C be F, and let D be T. 

1.36 For /, 

(4 A 2? A C) V (~iA AB AC) V (A A ~<B A C) V (~>A A ~>B A ^C) 

1.37 For =$■ and V, notice that any statement form built up using => and V 
will always take the value T when the statement letters in it are T. In the case 
of -i and <=>, using only the statement letters A and B, find all the truth 
functions of two variables that can be generated by applying -> and any 
number of times. 

1.40 (a) 2 4 = 16 (b) 2 2 " 
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1.41 h(C, C, C) = -.C and h(B, B , -.C) is £ ^ C. 

1.42 (b) For ->(A => 5) V (-u4 A C), a disjunctive normal form is (A A -i2?)V 

(-i A A C), and a conjunctive normal form is (A V C) A (-ii? V — ) A (-ii? V C). 
(c) (i) For (A A B) V ->A, a full dnf is (A A B) V (~>A A B) V (-vf A -7?), and 

a full cnf is 5 V ->A . 

1.43 (b) (i) Yes. A: T, B: T, C: F (ii) Yes. A: T, B: F, C : T 

1.45 (b) A conjunction £ of the form B\ A. .. A B*, where each B* is either 
Bj or -i Bi, is said to be eligible if some assignment of truth values to the 
statement letters of 88 that makes 88 true also makes £ true. Let Y be the 


disjunction of all eligible conjunctions. 

1.47 (b) 

2.88^<£ 

3. {& =>(&=> $)) => ((a => <&) 
=► {88 => &)) 

4. {<£ =► @) => {88 => {<£ => &)) 

5 . 88 =► {<£ =* &) 

6 . {88 =* <£) => {88 => @) 

7 . 88 =*> 9 

1.48(a) l. 8=>-nB 

2. — i - 1 AB =r~ {—>B =$■ 

3. 88 =*► {^88 => <#) 

4. m => \m v <£) 

(c) 1. —f£ => 88 

2 . {-f£ =* 88) => (-,& => 

3. ->& => 

4. => <£ 

5. ^88^<£ 

6 . ~ f £ =^88 I — '88 =£■ *£ 


Hypothesis 
Hypothesis 
Axiom (A2) 

Axiom (Al) 

1, 4, MP 
3, 5, MP 

2, 6, MP 
Lemma 1 . 11 (b) 
Lemma 1.11(c) 

1, 2, Corollary 1.10(a) 

3, Abbreviation 
Hypothesis 
Lemma 1.11(e) 

1, 2, MP 
Lemma 1.11(a) 

3, 4, Corollary 1.10(a) 
1-5 


7. h {-A£ =^- 88) => (-i 88 ^) 6, deduction theorem 

8 . \~ {<£ \J 88) => {88 V ( €) 7, abbreviation 

1.50 Take any assignment of truth values to the statement letters of 88 that 
makes 88 false. Replace in 88 each letter having the value T by A i V ~A \ , and 
each letter having the value F by A \ A ->A i . Call the resulting statement form 
< 8. Thus, ^ is an axiom of L*, and , therefore, El* < 8- Observe that always 
has the value F for any truth assignment. Hence, -88 is a tautology. So 
El and, therefore, El* 

1.51 (Deborah Moll) Use two truth values. Let => have its usual table and 
let -i be interpreted as the constant function F. When B is F, (-i B => -i A) => 
((-iB =>A)^B) is F. 

1.52 The theorems of P are the same as the axioms. Assume that P is 


suitable for some ^-valued logic. Then, for all values k, k * k will be a 
designated value. Consider the sequence of formulas 88q = A, 88j + \ — A * 88y 
Since there are n" possible truth functions of one variable, among 
88 $, . . . , 88 ,f there must be two different formulas 88 j and 88 ^ that determine 
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ANSWERS TO SELECTED EXERCISES 


the same truth function. Hence, 88 j * 88k will be an exceptional formula that 
is not a theorem. 

1.53 Take as axioms all exceptional formulas, and the identity function as 
the only rule of inference. 


CHAPTER 2 

2.1 (a) ((Vxi)(i4}(jci) A (“U!(* 2 )))) (6) (((Vx 2 )/1|(x2)) ^/4!(x 2 )) 

(d) (((Vxi)((Vx 3 )((Vx 4 )/4|(x 1 )))) =4> (A\(x 2 ) A (-^4 J (*i ) ) ) ) 

2.2 (a) ((Vxj ){a\ (xj ) =>- X|(xi))) V 

2.3 (a) The only free occurrence of a variable is that of x 2 . 

(b) The first occurrence of x 3 is free, as is the last occurrence of x 2 . 
2.6 Yes, in parts (a), (c) and (e) 

2.8 (a) (Vjc)(P(jc) =>£(*)) 

(b) (Vx)(P(x) =4- ~IH(x)) or ~|(3 x)(P(jc) A H(x)) 

(c) "l(Vx)(i?(*)^F(*)) 

(d) (V*)(£(x) => ->F(x)) (e) T(x) => I(x) 

(f) (Vx)(Vy)(S(x) A D(x,y) =>J(y)) 

(j) (Vx)(^H(x,x) => H{j,x)) or (' Vx) (P(x) A -.#(*, x) => H{j,x )) 

(In the second wf, we have specified that John hates those persons 
who do not hate themselves, where P(x) means x is a person.) 

2.9 (a) All bachelors are unhappy, (c) There is no greatest integer. 

2.10 (a) (i) is satisfied by all pairs (x 1 ,x 2 ) of positive integers such that 

x\ • x 2 > 2. 

(ii) is satisfied by all pairs (x\,x 2 ) of positive integers such that 
either x\ < x 2 (when the antecedent is false) or x\ = x 2 (when 
the antecedent and consequent are both true). 

(iii) is true. 

2.11 (a) Between any two real numbers there is a rational number. 

2.12 (I) A sequence s satisfies ~\88 if and only if s does not satisfy 88. 

Hence, all sequences satisfy "1 88 if and only if no sequence satisfies 
88\ that is, ~\88 is true if and only if 88 is false. 

(II) There is at least one sequence s in If s satisfies 88, 88 cannot be 
false for M. If s does not satisfy 88, 88 cannot be true for M. 

(III) If a sequence s satisfies both 88 and 88 =y %>, then 5 satisfies by 
condition 3 of the definition. 

(V) (a) s satisfies 88 A if and only if s satisfies “| (88 => “]#) 
if and only if 5 does not satisfy 88 ==> “l^ 

if and only if s satisfies 88 but not “1 ^ 
if and only if s satisfies 88 and s satisfies ^ 

(VI) (a) Assume |=m 88. Then every sequence satisfies 88. In particular, 
every sequence that differs from a sequence s in at most the ith 
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place satisfies 38. So, every sequence satisfies (Vx,)^; that is, 

|=M 

(b) Assume |=m {f]xj)38. If s is a sequence, then any sequence that 
differs from s in at most the ith place satisfies 38, and, in particular, 
s satisfies 38. Then every sequence satisfies 38; that is, |=m 38. 

(VIII) Lemma. If all the variables in a term t occur in the list x,-, , . . . , x ik 
(i k > 0; when k = 0, t has no variables), and if the sequences s 
and s' have the same components in the qth, . . . , 4-th places, 
then s*(t) = (s')*(t). 

Proof. Induction on the number m of function letter in t. Assume 
the result holds for all integers less than m. 

Case 1. t is an individual constant a p . Then s*(a p ) = (a p ) 

= O0*(o p ). 

Case 2. t is a variable x,-.. Then s* (x, . ) = s-,. = s' r = (/)*(x,-.). 

Case 3. t is of the form fj‘(t \, . . . , t n ). For g^n, each t q has fewer 

J * 

than m function letters and all its variables occur among 
j , . . . , at By inductive hypothesis, s*(t q ) = (s')*(t q ). Then 

= (yj') M 0*0i),...,^0„)) = (Jjr (M*(< 1) 

Proof of (VIII). Induction on the number r of connectives and 
quantifiers in 38. Assume the result holds for all q < r. 

Case 1. 38 is of the form A f j(t that is, r = 0. All the 
variables of each occur among x,-, , . . . , x IJt . Hence, by the 
lemma, s*(tj) — ( s')*(tj ). But s satisfies A"(ti if and only 

if (s*(t \), . . . is in that is, if and only if 

{(/)*(?i), (/)*(?„)) is in (A'j ) M , which is equivalent to s' 
satisfying A"(ti, . . . ,t„). 

Case 2. 38 is of the form "1 ( 3. 

Case 3. 38 is of the form <3 => 32. Both cases 2 and 3 are easy. 
Case 4. 38 is of the form (Vx y )^. The free variables of occur 
among x,-,, . . . ,x ik and x y . Assume s satisfies 38. Then every se- 
quence that differs from s in at most the jth place satisfies < 3. Let 
s& be any sequence that differs from s' in at most the fth place. 
Let s b be a sequence that has the same components as s in all but 
the yth place, where it has the same component as s#. Hence, s b 
satisfies <3. Since s h and s# agree in the /‘jth,. . . , i k th and jth 
places, it follows by inductive hypothesis that s b satisfies ( 8 if 
and only if s# satisfies < 3. Hence, s# satisfies < 3. Thus, s' satisfies 
38. By symmetry, the converse also holds. 

( IX) Assume 38 is closed. By ( VIII), for any sequence s and s', s satisfies 
38 if and only if s' satisfies 38. If -i 38 is not true for M, some 
sequence s' does not satisfy ->38\ that is, s’ satisfies 38. Hence, every 
sequence s satisfies 38; that is, |=m 38. 
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(X) Proof of Lemma 1: induction on the number m of function letters 
in t. 

Case 1. t is aj. Then t’ is u y -. Hence, 


s’( 0 = **(«/) = (aj) M = W)’(aj) = M*(0 


Case 2. t is xj, where j f i. Then t ' is Xj. By the lemma of ( V1IT), 
s*(t') = (s')*(t), since s and s' have the same component in the 
y'th place. 

Case 3. t is Xj. Then t' is u. Hence, s*(t') = £*(*/), while (s')*(t) = 
(s')* (Xj) = s^ = s*(u). 

Case 4. t is of the form fjfti, . . . ,t„). For \^q^n, let t' result 
from t q by the substitution of u for x,. By inductive hypothesis 

wi) = wm- But 




• ,'„')) = (. /T)V(0.- 



= = wruj(t, ,■■■,!.)) = 

Proof of Lemma 2(a): induction on the number m of connectives 
and quantifiers in 43 (xj). 

Case 1. m — 0. Then 43 (x^ is A”(t\ } . . . , t„). Let t ' be the result of 
substituting t for all occurrences of x- t in t q . Thus, 43 (t) is 
Aj(t (, . . . , t'). By Lemma 1, s*(t q ) = ( s'Y(t q ). Now, s satisfies 
43(t) if and only if (s*(t {), . . . ,^*(^ / )) belongs to (A’)) M , which is 
equivalent to • • • > ( s ')*(bi)) belonging to (A'j) M - that 

is, to s' satisfying 43 (xf). 

Case 2. 43(xj) is -> #(*,•); this is straightforward. 

Case 3. 43(xj) is t tf(x i )=>@(xi); this is straightforward. 

Case 4. 43(xi) is ('4xj)43(x i ). 

Case 4a. xj is Xj. Then Xj is not free in 43 (x /), and 43(f) is 43 (x,-). 
Since x ,■ is not free in 43 (x t ), it follows by (VIII) that 5 satisfies 
43(t) if and only if s' satisfies 43 (xj). 

Case 4b. xj is different from x,. Since t is free for x { in 43 (x,), t is also 
free for Xj in ^(x,). 

Assume s satisfies (Vxy)^(t). We must show that s' satisfies 
(Vxy) ( <4(xi). Let s# differ from s' in at most the y'th place. It 
suffices to show that s# satisfies c 4(xj). Let s b be the same as s# 
except that it has the same ith component as s. Hence, s b is the 
same as s except in its /th component. Since 5 satisfies (Vx y )^( /), 
s b satisfies ^(t). Now, since t is free for x t in (Vx y )^(x/), t does 
not contain xj. (The other possibility, that x t is not free in ^(x,-), 
is handled as in case 4a.) Hence, by the lemma of (VIII), 
(s b )*(t) = s*(t). Hence, by the inductive hypothesis and the fact 
that s# is obtained from s b by substituting (s b )*(t) for the ith 
component of s b , it follows that s# satisfies ^(x 7 ), if and only if 
s b satisfies b 4(t). Since s b satisfies r 4(t), s# satisfies (xf ) . 
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Conversely, assume s' satisfies (Vxy)^(x/). Let s h differ from s 
in at most the /'th place. Let s# be the same as s' except in the y'th 
place, where it is the same as s b . Then s# satisfies ^(x,). As 
above, .$*(/) = (,s 6 )*(f). Hence, by the inductive hypothesis, s b 
satisfies %>(t) if and only if s# satisfies (x,- ) . Since s# satisfies 
< &(xi),s b satisfies ^(t). Therefore, s satisfies (Vx y -)#( 7 ). 

Proof of Lemma 2(b). Assume s satisfies (Vx,)^(x/). We must 
show that s satisfies 36 (t). Let s' arise from s by substituting s*(t) 
for the /th component of s. Since s satisfies (Vx/)^?(x,) and s' 
differs from 5 in at most the /th place, s' satisfies 36 (x,-). By 
Lemma 2(a), s satisfies 36 (t). 

2.13 Assume 36 is satisfied by a sequence s. Let s' be any sequence. By (VIII), 
s' also satisfies 36. Hence, 36 is satisfied by all sequences; that is, |=m 36. 

2.14 (a) x is a common divisor of y and z. (d) xj is a bachelor. 

2.15 (a) (i) Every non-negative integer is even or odd. True. 

(ii) If the product of two non-negative integers is zero, at least one 
of them is zero. True, (iii) 1 is even. False. 

2.17 (a) Consider an interpretation with the set of integers as its domain. 

Let A\(x) mean that x is even and let A\(x) mean that x is odd. 
Then (Vxi)/4}(xi) is false, and so (Vxi)yf] (xi)=^(Vxi)/l 2 (*i) is true. 
However, (Vxi)(yfJ (xj )=M 2 (xi)) is false, since it asserts that al] 
even integers are odd. 

2.18 (a) [(\/x/)->^ , (x l )=^-i^(/)]=^[^(/)=^-i(Vx/)-i^’(x f )] is logically valid 

because it is an instance of the tautology (A=y-iB)=>(B=>-<A). 
By (X), (\fxi)-i36(xi)=y->36(t) is logically valid. Hence, by (III), 
^(/)=^-i(Vx/) -1 36{xi) is logically valid. 

(b) Intuitive proof: If 36 is true for all x,-, then 36 is true for some x,-. 
Rigorous proof: Assume ( Vx, ) ( 3x ,- ) 36 is not logically valid. 

Then there is an interpretation M for which it is not true. Hence, 
there is a sequence s in such that s satisfies (6x2)36 and s does 
not satisfy -i(6xj)-i36. From the latter, s satisfies (6xj)->36. Since s 
satisfies (6x2)36, s satisfies 36, and, since s satisfies (6xi)->36, s sat- 
isfies - 1 36. But then s satisfies both 36 and - 1 36, which is impossible. 

2.19 (b) Take the domain to be the set of integers and \etA\(i2) mean that u 

is even. A sequence s in which si is even satisfies A\(xi) but does 
not satisfy (Vxi)/f|(xi). 

2.21 (a) Let the domain be the set of integers and let Aj(x,y) mean that 

x<y. (b) Same interpretation as in (a). 

2.22 (a) The premisses are (i) (6x)(S(x)=>N(x)) and (ii) (Vx)(F(x)=^ 

—>M (x)), and the conclusion is (Vx) ( V {x)=^>^S (x)) . Intuitive proof: 
Assume V(x). By (ii), ~>N (x) . By (i), ->S (x) . Thus, ~>S (x) follows 
from F(x), and the conclusion holds. A more rigorous proof can 
be given along the lines of ( I)-( XI), but a better proof will become 
available after the study of predicate calculi. 
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2.26 (a) (3x)(3y)(i4j(x) A ->A\ (y)) 

2.27 (a) 1. (Vx) (88=y<g) 

2. ( Vx) 83 

3. (\/x)(88=><$)=>(88=><8) 

4. 83=><$ 

5. (\/x)83=>83 

6 . 83 

1 . 


Hyp 

Hyp 

Axiom (A4) 

1,3, MP 

Axiom (A4) 

2.5, MP 

4.6, MP 


8. (V*)# 7, Gen 

9. (V*) (#=*#), (Vx)<0 b (Vx)tf l 8 

10. (Vx) (#=*#) b (Vx)^=y(Vx)^ 1-9, Corrollary 2.6 

11. b (Vx)(^=>^)=^((Vx)^=^(Vx)^) 1-10, Corollary 2.6 

2.28 Assume b^ By induction on the number of steps in the 

proof of 88 in K, prove that, for any variables y \ , . . . , y„ (n > 0) , 
b K ^ (Vyi) . . . (Vy n )83. 

2.31 (a) 1. (Vx) (Vy)A \ (x, y) Hyp 

2. (^y)A \ (x, y) 1, Rule A4 

3. ^4j(x,x) 2, Rule A4 

4. (Vx)y4j(x,x) 3, Gen 

5. (Vx)(Vy/4j(x,y) I- (Vx/4j(x,x) 1-4 

6. b (Vx)(Vy)Xi (x,_y)=y(Vx)/4i(x,x) 1—5, Corollary 2.6 

2.33 (a) I — '(Vx)-i-i 83^=^-,(\/x)->83 by the replacement theorem and the fact 
that I — '- 1 83<=$>83. Replace -i(Vx)-i-i 83 by its abbreviation (3x)->83. 

2.36 (b) (3 e)(c > 0 A (V<5)(<5 > 0=^(3 x)(|x - c\ < 3 A -.|/(x) -/(c) | < e ))) 

2.37 (a) (i) Assume I- 83. By moving the negation step-by-step inward to 

the atomic wfs, show that I — ■ 83*^^ ( 8, where ^ is obtained 


from 83 by replacing all atomic wfs by their negations. But, 
from b 83 it can be shown that I- < 8. Hence, I — 1 83*. The converse 
follows by noting that (83*)* is 83. 

(ii) Apply (i) to -,83 V <ti. 

2.39 1. (3y) (Vx) (A? (x, y)<=^>-iA?(x, x) ) Hyp 

2. (Vx)(y4f(x,6)^-v4f(x,x)) 1, Rule C 

3. A 2 Ab,y)<=^-,A\(b, b) 2, Rule A4 

4. V A -M 3, Tautology 

is any wf not containing b.) Use Proposition 2.10 and proof by con- 
tradiction. 


2.46 (a) In step 4, b is not a new individual constant. It was already used in 

step 2. 

2.49 Assume K is complete and let 83 and ^ be closed wfs of K such that 
b k 88 V *$. Assume not-bx 83. Then, by completeness, bK -,83. Hence, by the 
tautology - 1 A=>((A V 83)=>83), b K 83. Conversely, assume K is not com- 
plete. Then there is a sentence 83 of K such that not-b K 83 and not-b K -,83. 
However, bK 83 V -,83. 

2.50 See Tarski, Mostowski and Robinson (1953, pp. 15-16). 
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2.55 (b) It suffices to assume 38 is a closed wf. (Otherwise, look at the 

closure of 38.) We can effectively write all the interpretations on a 
finite domain {b\,..., b^}. (We need only specify the interpreta- 
tions of the symbols that occur in 38.) For every such interpre- 
tation, replace every wf (Vx)#(x), where ^(x) has no quantifiers, 
by ^{b\) A ... A ^( bk ), and continue until no quantifiers are left. 
One can then evaluate the truth of the resulting wf for the given 
interpretation. 

2.59 Assume K is not finitely axiomatizable. Let the axioms of K\ be 

3ft \, 3ft 2 and let the axioms of K 2 be ^ 1 ,^ 2 , Then 

{3ft j, ^ 1 ,-^ 2 , ^ 2 , ■ ■ •} is consistent. (If not, some finite subset {3ft 1 , 
3ft2> • • - , 38k, ^ 1 , • • • ) is inconsistent. Since Kj is not finitely axiomatiz- 
able, there is a theorem 3ft of Ki such that 3ft \ , 3ft 2 ^ . . . , 3ft ^ b 3ft does not hold. 
Hence, the theory with axioms {3ft \ , 3ft 2 , . . . , 3ftu, 38 } has a model M. Since 
l-K 38, M must be a model of K 2 , and, therefore, M is a model of 
{3ft 1 , ^ 2 , . • • , 38k, ^ 1 , • • • , m }, contradicting the inconsistency of this set of 
wfs.) Since {3ft\ , , 382, ^ 2 , . . .} is consistent, it has a model, which must be 

a model of both K\ and K 2 . 

2.60 Hint. Let the closures of the axioms of K be 38\,3ft 2 , Choose a 

subsequence 38 j , , 3ft j 2 , . . . such that 3ftj n+l is the first sentence (if any) after 38 j n 
that is not deducible from 3ftj t A ... A 3ft j n . Let ^ be ^j\ 3 38 j 2 A • • - A 38 j t . 
Then the ^s form an axiom set for the theorems of K such that 
(- but not-b k=> < & k+i . Then {^ 1 , ^ 1 =^ 2 , ^ 2 =^ 3 , . . .} is an 

independent axiomatization of K. 

2.61 Assume 38 is not logically valid. Then the closure # of 38 is not logi- 
cally valid. Hence, the theory K with -33 as its only proper axiom has a 
model. By the Skolem-Lowenheim theorem, K has a denumerable model 
and, by the lemma in the proof of Corollary 2.22, K has a model of car- 
dinality m. Hence, ^ is false in this model and, therefore, 3ft is not true in 
some model of cardinality m. 

2.65 (c) x = x Proposition 2.23(a) 

2. (3y)x = y l, rule E4 

3. (Vx)(3y)x — y 2, Gen 

2.68 (a) The problem obviously reduces to the case of substitution for a 

single variable at a time: x\ — y\=^t{x\) = t(j ; i). From (A7), 
h xi = yi=^(t(xi) = ^(xi)=^t(xi) = t(yi)). By Proposition 2.23 
(a), b f(*i) = t(xi). Hence, bxj = y{=>t(x\) = t(yi). 

2.70 (a) By Exercise 2.65(c), b (3y)x = y. By Proposition 2.23(b,c), 

b (Vy)(Vz)(x = y A x = z=^y = z). Hence, b (3iy)x = y. By Gen, 
b (Vx)(3iy)x = y. 

2.71 (b) (i) Let Ai / xj stand for the conjunction of all wfs of the 

form Xj ^ Xj, where U / <;<«. Let 3ft n be (3xi) . . . (3x„) 

Ai ^i<j^ n Xi 7 ^ Xj. 
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(ii) Assume there is a theory with axioms stf \ , . . . , s$ n that has the 
same theorems as K. Each of stf \ , . . . , srf n is provable from K\ 

plus a finite number of the wfs ^ 1 ,^ 2 , Hence, K\ plus a 

finite number of wfs , . . . , @lj tt suffices to prove all theorems 
of K. We may assume j\ < ■ •■ <j„. Then an interpretation 
whose domain consists of j„ objects would be a model of K, 
contradicting the fact that & Jn +\ is an axiom of K. 

2.74 For the independence of axioms (Al) - (A3), replace all t = s by the 
statement form A => A; then erase all quantifiers, terms and associated 
commas and parentheses; axioms (A4) - (A 6 ) go over into statement forms 
of the form P => P, and axiom (A7) into (P P) => (Q => Q). For the 
independence of axiom (Al), the following four-valued logic, due to Dr 
D.K. Roy, works, where 0 is the only designated value. 

A B A^B A B A^B A B A^B A B A^B A ~\A 

000 100 200 300 01 

011 11 0 210 31 1 10 

02 1 12 0 22 0 32 1 20 

03 1 13 0 23 0 33 0 30 

When A and B take the values 3 and 0, respectively, axiom (Al) takes the 
value 1. For the independence of axiom (A 2 ), Dr Roy devised the following 
four-valued logic, where 0 is the only designated value. 

A B A => B A B A => B A B A => B A B A^B A ~iA 

000 10 0 20 0 30 0 01 

01 1 1 1 0 210 310 10 

02 l 12 0 220 32 1 20 

03 L 13 0 23 0 33 0 30 

If A, B and C take the values 3, 0 and 2 respectively, then axiom (A2) is 1 . 
For the independence of axiom (A3), the proof on page 44 works. For 
axiom (A4), replace all universal quantifiers by existential quantifiers. For 
axiom (A5), change all terms t to x\ and replace all universal quantifiers by 
( Vxi). For axiom (A 6 ), replace all wfs t = s by the negation of some fixed 
theorem. For axiom (A7), consider an interpretation in which the inter- 
pretation of = is a reflexive non-symmetric relation. 

2.83 (a) (Vx)(3j)((3z)(^(z,x,^,...,j;) AA\(x,y,z)) =>• (3z)(^(z, y,x , . . . ,x) 

A z = x)) 

2.84 (a) (3z)(Vw)(3x)([/4j(x) ^ A]{x,y)) => [4j(w) => A^(y,z)]) 

2.87 6? has the form (3x)(3y)(Vz)([A^(x,y) => ^4{(jc)] => A\{z)). Let the 
domain D be (1,2), let A^ be < , and let A\ (w) stand for u = 2. Then £? is 
true, but (Vx)(3 y)A^(x,y) is false. 

2.88 Let g be a one-one correspondence between D* and D. Define: 
( a j) M = 0((tf/) M ); ( fj) M (b u ...,b„)=g 1 [(f?) M (g(bi ), . . . , g(b n ))]; 

|=m* if and only if (= M ^[gfa ), . . . , 0 ( 6 *)]. 
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2.95 Hint : Extend K by adding axioms 8ft n , where 8ft n asserts that there are 
at least n elements. The new theory has no finite models. 

2.96 (a) Hint : Consider the wfs 8ft n , where 8ft n asserts that there are at least 

n elements. Use elimination of quantifiers, treating the 8ft „s as if 
they were atomic wfs. 

2.101 Let W be any set. For each b in W, let at be an individual constant. 
Let the theory K have as its proper axioms: at ^ a c for all b, c in W such 
that b^c, plus the axioms for a total order. K is consistent, since any finite 
subset of its axioms has a model. (Any such finite subset contains only a 
finite number of individual constants. One can define a total order on any 
finite set B by using the one-one correspondence between B and a set 
{1, 2, 3 and carrying over to B the total order < on {1, 2, 3, ... , «}.) 
Since K is consistent, K has a model M by the generalized completeness 
theorem. The domain D of M is totally ordered by the relation < M ; hence, 
the subset D w of D consisting of the objects (at) M is totally ordered by < M . 
This total ordering of D w can then be carried over to a total ordering of 
W : b < w c if and only if ab < M a c . 

2.103 Assume M] is finite and Mi = M 2 . Let the domain D\ of Mj have n 
elements. Then, since the assertion that a model has exactly n elements can 
be written as a sentence, the domain Dj, of M 2 must also have n elements. 
Let D\ = {hi , . . . , b n } and D 2 = {ci , . . . , c„}. 

Assume Mi and M 2 are not isomorphic. Let (p be any one of the n\ one-one 
correspondences between D\ and Dj. Since <p is not an isomorphism, either: 
( 1 ) there is an individual constant a and an element bj of D\ such that either 
(i) bj = a Ml A <p(bj ) 7 ^ a Ml or (ii) bj 7 ^ a M] A (p(bj) = a Mz ; or (2) there is a 
function letter f[ n and be , bj t , . . . , bj m in D\ such that 

b t = if t T' (K . • ■ • . K) “d <p(b ,) * UT) M '(<f(b h ) cp(bjJ) 

or (3) there is a predicate letter A™ and bj x , . . . , b Jm in D\ such that either 

(i) (=m, A n k [Ay, , • • ■ , b jn ] and |= Ma ~\A%[(p(bj } <p(b Jm )] or 

(ii) |= M| ^ A k[ h h > ■ • • > b j„] and I=m 2 Ap^bjX • • • , Construct a wf 

8ft (p as follows: 

' Xj = a if ( 1 ) (i) holds 

Xj 7 ^ a if ( 1 ) (ii) holds 

8ft < P is < xe= f™ (x j] t..,XjJ if ( 2 ) holds 

ft’k ixji if (3) (i) holds 

l “I Afixfi ,...,x Jm ) if (3) (ii) holds 
Let (pi , . . . , (p H 1 be the one-one conespondences between D\ and Dj- 
Let srf be the wf 




Xj A Xj A 8ft 


A 8ft ^ A 


Then is true for Mj but not for M 2 . 



« • « 
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2.104 (a) There are sentences in the language of K. Hence, there are 

sets of sentences. If M) = M 2 does not hold, then the set of sen- 
tences true for M] is different from the set of sentences true for 
M 2 . 

2.105 Let K* be the theory with N } , new symbols b x and, as axioms, all 
sentences true for M and all b x b p for t 7 - p. Prove K* consistent and 
apply Corollary 2.34. 

2.108 (a) Let M be the field of rational numbers and let X = {—1}. 

2.110 Consider the wf ( 3 x 2 ) *2 < * 1 * 

2.111 (a) (ii) Introduce a new individual constant b and form a new theory 

by adding to the complete diagram of Mi all the sentences 
b t for all closed terms t of the language of K. 

2.112 If 0 ^ $F g?(A). Conversely, if 0 G 3F , then, by clause (3) of 
the definition of filter, #” = FA (A). 

2.113 If BF — !F b , then Dce^ C = B E tF . Conversely, if B = f)ce& C G 
then 3F — F'b. 

2.114 Use Exercise 2.113. 

2.115 (a) A G FF , since A = A — 0. 

(b) If B = A — W\ G !F and C = A — Wi G where W\ and W 2 are 
finite, then BnC — A — (W\ u Wfi) G $F, since WikjWj is finite. 

(c) If B — A — W G where W is finite, and if B CC, then 
C = A — {W — C) G , since W — C is finite. 

. (d) Let B C C. So, B — A — W, where W is finite. Let b G B. Then 
WKj{b) is finite. Hence, C = A — (W<j{b}) G But, B $2 C, since 
b C. Therefore, #" 7 - 2F b- 

2.118 Let = {D\D c A A (3C)(C G & A BnC C £>)}. 

2.119 Assume that, for every B QA, either B E £F or A — BE#'. Let be a 
filter such that C Let B G ^ — #3 Then A — B E !F . Hence, A — B E&. 
So, 0 = Z?n(/4 — B) G ^ and ^ is improper. The converse follows from Ex- 
ercise 2.118. 

2.120 Assume ZF is an ultrafilter and B ^ C ^ By Exercise 2.119, 

A-BE& and ^ - C G Hence, A - (5uC) = - 5)n04 - C) G 

Since is proper, SuC ^ Conversely, assume B !\C ^ 

=>- 5uC ^ . Since Z?u04 — 5) = ^ G this implies that, if B !F , then 

A — B E #\ Use Exercise 2.119. 

2.121 (a) Assume #c is a principal ultrafilter. Let a EC and assume 

C {«}. Then {c} ^ and C — {a} ^ !F c- By Exercise 2.120, 
c= M u(C — {«}) $ ^ c , which yields a contradiction. 

(b) Assume a non-principal ultrafilter SF contains a finite set, and let B 
be a finite set in !F of least cardinality. Since !F is non-principal, 
the cardinality of B is greater than 1 . Let b E B. Then B — {£>} fi 0. 
Both { b } and B — {£>} are finite sets of lower cardinality than B. 
Hence, {b} $ SF and B — {*} £ <F r . By Exercise 2.120, 
B={b } u (B — {£}) $ !F, which contradicts the definition of B. 
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2.124 Let J be the set of all finite subsets of T. For each A in J, choose a 

model M a of A. For A in let A* = {A' | A' G J A A C A'}. The collection 0 
of all A*s has the finite-intersection property. By Exercise 2.117, there is a 
proper filter 3F D <$. By the ultrafilter theorem, there is an ultrafilter 
#■' D 3F D Consider EIagj Ma/^- Let Then {^?}* G 0 C #/ 

Therefore, {£$}* C {A| A G G #/ By Los’s theorem, 38 is true in 

2.125 (a) Assume if is closed under elementary equivalence and ultra- 

products. Let A be the set of all sentences of 5£ that are true in 
every interpretation in if. Let M be any model of A. We must 
show that M is in if. Let T be the set of all sentences true for M. 
Let J be the set of finite subsets of T. For T 1 = {38 \ , . . . , 38 „} G J, 
choose an interpretation N r > in if such that 38 \ A ... A 38 n is true 
in N r '. (If there were no such interpretation, ->{38 \ A... f\38 n ), 
though false in M, would be in A.) As in Exercise 2.124, there is an 
ultrafilter 3f' such that N* = Ilr'ey Nr f 3F' is a model of T. Now, 
N* G if. Moreover, M = N*. Hence, M G if. 

(b) Use (a) and Exercise 2.59. 

(c) Let if be the class of all fields of characteristic 0. Let 3f be a non- 
principal ultrafilter on the set P of primes, and consider 

M = FUp Z p/&- Apply (b)- 

2.126 R & C R*. Hence, the cardinality of R * is > 2^°. On the other hand, R 01 
is equinumerous with 2 C> and, therefore, has cardinality 2 K °. But the cardi- 
nality of R* is at most that of R (0 . 

2.127 Assume x and y are infinitesimals. Let e be any positive real. Then 

|xj < e/2 and \y\ < e/2. So, |x + y| ^ \x\ + \y\ < e/2 + e/2 = e; \xy\ = \x\\y\ 

< 1 ■ e = e; |x — yj ^ \x\ + | — < e/2 + e/2 = e. 

2.128 Assume |x| < a*, and [y| < e for all positive real e. Let e be a positive 
real. Then e/r\ is a positive real. Hence [y| < e/i\, and so, 
M = M It I < n (e/e, ) = e. 

2.130 Assume x — a*, and x — r 2 are infinitesimals, with r, and r 2 real. Then 
(x — a*,) — (x — r 2 ) = t'2 — V \ is infinitesimal and real. Hence, r 2 — r \ = 0. 

2.131 (a) x — st(x) and y — st( t) are infinitesimals. Hence, their sum 
(x + t) ~~ (st(x) + s t(y)) I s an infinitesimal. Since st(x) + st(.y) is real, 
st(x) + st(y) = st(x + T) by Exercise 2.130. 

2.132 (a) By Proposition 2.45, s/at) « c, and u*(n) ~ c 2 for all n G of — co. 

Hence, f(n) + u*{n) « c, + c 2 for all at G co* — co. But 

5* (at) + m*(at) = {s + i/)*(at). Apply Proposition 2.45. 

2.133 Assume / continuous at c. Take any positive real e. Then there is a 

positive real <5 such that (Vx)(x G B A |x — c| < <5 => I fix ) - f(c ) | < e) holds 
in Therefore, (Vx)(x G B* A |x — c| < S => \ f* (x) — /(c) | < e) holds in 

38*. So, if x G B* and x « c, then \x — c\< 5 and, therefore, 
!/•(*) -/mi < e. Since e was arbitrary, /* (x) « /(c) . Conversely, assume 
x G B* Ax « c => /* (x) « /(c). Take any positive real e. Let <5 q be a positive 
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infinitesimal. Then (Vx) (x G B* A |x — c| < do => | /* (x) — /(c) | < c) holds 
for 3£*. Hence, (3<5)(<5 > 0 A (Vx)(x G B* A |x — cj < 8 =£► |/'(x) — f{c)\ < e)) 
holds for and so, (3<5)(<5 > 0 A (Vx)(x G B A lx — cl < <5 => l/M 

—/(c) | < e)) holds in 0t. 

2.134 (a) Since xGl?*Ax«c=^ (/*(x) « /(c) A g*(x) « <j(c)) by Proposi- 
tion 2.46, we can conclude Ar«c=> (/ + g)*(x) 

« (/ + g)(c), and so, by Proposition 2.46, f -\-g is continuous 
at c. 

2.139 (a) (i) n[(Vx)(^J(x) VA^(x)) => ((Vx)Aj(x)) V (Vx)A^ (x)] 


(ii) (Vx)(<4j (x) V A l 2 (x)) 

0) 

(iii) -l[((Vx)^J(x)) V (Vx)^J(x)] 

(i) 

(iv) 1 (Vx)A J (*) 

(iii) 

(v) 1 (Vx)^(x) 

(iii) 

(vi) (3x)n^l(x) 

(iv) 

(vii) (3x)-l A\{x) 

(v) 

(viii) ~^A\{b) 

(ix) “3 A\ (c) 

(vi) 

(vii) 

(x) A){b) V A\{b) 

(ii) 

S \ 

(xi) A\ (b) A\{b) 

(x) 

(xii) X A\(c) V Afa) 

(ii) 

s \ 

(xiii) A](c) A\(c) 

(xii) 


X 


No farther rules are applicable and there is an unclosed branch. Let the 
model M have domain {6, c}, let (A\) M hold only for c, and let (A*,)™ hold 
for only b. Then, (Vx)(^4[(x) V A ] 2 (x)) is true for M, but (Vx)A\(x) and 
(Vx)^ 2 (x) are both false for M. Hence, (Vx) (A \ (x) V ^ (x) ) =>■ ((Vx)^{(x)) 
V (Vx)^ 2 (x) is not logically valid. 


CHAPTER 3 

3.4 Consider the interpretation that has as its domain the set of polynomials 
with integral coefficients such that the leading coefficient is non-negative. 
The usual operations of addition and multiplication are the interpretations 
of + and •. Verify that (S1)-(S8) hold but that Proposition 3.11 is false 
(substituting the polynomial x for x and 2 for y). 
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3.5 (a) Form a new theory S' by adding to S a new individual constant b 

and the axioms b ^ 0, b ^ I, b ^ 2, . . . , b ^ n, ... . Show that S' is 
consistent, and apply Proposition 2.26 and Corollary 2.34(c). 

(b) By a cortege let us mean any denumerable sequence of Os and Is. 
There are 2 H ° corteges. An element c of a denumerable model M of 
S determines a cortege (5 0 ,5j ,S 2 , ■ ■ •) as follows: 57 = 0 if |=m pi I O 
and Sj = 1 if |=m ~l (aI c )- Consider now any cortege s. Add a new 
constant b to S, together with the axioms £di{b), where i%j(b) is pi\b 
if St — 0 and ^,(6) is {pi\b) if Sj = 1. This theory is consistent and, 
therefore, has a denumerable model M s , in which the interpretation 
of b determines the cortege s. Thus, each of the 2 Ho corteges is 
determined by an element of some denumerable model. Every 
denumerable model determines denumerably many corteges. 
Therefore, if a maximal collection of mutually non-isomorphic 
denumerable models had cardinality m < 2 Ho , then the total num- 
ber of corteges represented in all denumerable models would be 
^ m x < 2 N ° . (We use the fact that the elements of a denumer- 
able model determine the same corteges as the elements of an iso- 
morphic model.) 

3.6 Let ( D , 0, ') be one model of Peano’s postulates, with 0 e D and ' the 
successor operation, and let (£)#, 0#,* ) be another such model. For each x 
in D, by an x-mapping we mean a function / from S* = [u\u 

into D# such that /( 0) = 0# and f(i /) = ( f{u ))* for all u <x. Show by 
induction that, for every x in D, there is a unique x-mapping (which will be 
denoted f x ). It is easy to see that, if x\ < *2, then the restriction of f X2 to S Xl 
must be f Xl . Define F(x) = f x (x) for all x in D. Then F is a function from D 
into D# such that F(0) = and F(x!) — (F(x)) + for all x in D. It is easy to 
prove that F is one-one. (If not, a contradiction results when we consider 
the least x in D for which there is some y in D such that x and 
F(x)=F(y).) To see that F is an isomorphism, it only remains to show that 
the range of F is D#. If not, let z be the least element of D# not in the range 
of F. Clearly, z ^ 0#. Hence, z = w* for some w. Then w is in the range of F, 
and so w = F(u) for some u in D. Therefore, F(i/) = ( F(u ))* = w* = z, 
contradicting the fact that z is not in the range of F. 

The reason why this proof does not work for models of first-order 
number theory S is that the proof uses mathematical induction and the least- 
number principle several times, and these uses involve properties that cannot 
be formulated within the language of S. Since the validity of mathematical 
induction and the least-number principle in models of S is guaranteed to 
hold, by virtue of axiom (S9), only for wfs of S, the categoricity proof is not 
applicable. For example, in a non-standard model for S, the property of 
being the interpretation of one of the standard integers 6, 1,2, 3 , . . . is not 
expressible by a wf of S. If it were, then, by axiom (S9), one could prove that 
{0, T, 2, 3 , . . .} constitutes the whole model. 






3.7 Use a reduction procedure similar to that given for the theory K 2 on 
pages 116-17. For any number k, define kt by induction: 0 * t is 0 and 
(k -F 1) • t is (k • t) + 1; thus, k • t is the sum of t taken k times. Also, for any 
given k , let t = 5 (mod k) stand for (3 x) (t = s + k ■ x V s = t + k • x). In the 
reduction procedure, consider all such wfs t = ^(mod k), as well as the wfs 
t < s, as atomic wfs, although they actually are not. Given any wfs of S + , we 
may assume by Proposition 2.30 that it is in prenex normal form. Describe a 
method that, given a wf (3 y)&, where contains no quantifiers (remem- 
bering the convention that t = .s(mod k) and t < s are considered atomic), 
finds an equivalent wf without quantifiers (again remembering our con- 
vention). For help on details, see Hilbert and Bemays (1934, 1, pp. 359-366). 

3.8 (b) Use part (a) and Proposition 3.6(a)(i). 

(c) Use part (b) and Lemma 1.12. 

3.13 Assume f(x\ , . . . , x„) — x„ +1 is expressible in S by 35{x\ , . . . , x„ +) ). Let 
^{x\ , . . . , x,h_] ) be @{x \ , . . . , x„ +L ) A (Vz)(z < x K+1 => “I @(x \ , . . . , x„ +! )). 
Show that ‘Z represents /(xj , . . . ,x„) in S. [Use Proposition 3.8(b).] Assume, 
conversely, that f{x \ , . . . ,x„) is representable in S by (xj , ... ,x„ + i). Show 
that the same wf expresses f{x \ , . . . , x „ ) = x w+ i in S. 

3.16 (a) (3y) w<J)<lJ R(x 1 ,.. ; ,x /1 ,y) is equivalent to (3 z) 2<1) . {m+1) R(x, , . . . ,x„, 
z + u + 1), and similarly for the other cases. 

3.18 If the relation R(xi, . . . ,x„,y) : f{x \ , . . . ,x„) = y is recursive, then Cr 
is recursive and, therefore, so is /(xj, . . . ,x„) = fj.y(Cn(xi, . . . , x„ , y ) = 0). 
Conversely, if f{x \ , . . . , x„) is recursive, Cr(xi , . . . , x n ,y) = 
Sg[/(x, , . . . , x„) — y\ is recursive. 

3.19 

[V"] - <5(/OVe»n(j' 2 > «)) 

n(») = 


3.20 [ne] = [n(l + 1 + 


+ 3 ! + 


+sr)]» 


since 




(»+ 2 )! 


< — 
^ nV 


Let 1 + 1 T ^ + • • • + Then g(0) = 1 and g(?t + 1) = (/? + l)g(ii) +L 

Hence, g is primitive recursive. Therefore, so is \ne\ — = qt(n!, ng(n)). 

3.21 RP(y,z) stands for (Vx) jr ^^ v+Z (x|y Ax|z => x = 1). 


<p{n) = ^ & s(CKp(y,n)) 

3.22 Z(0) = 0,Z(y + 1) = U\ ( y , Z( y ) ) . 

3.23 Let v = (popi . . .pk) + 1. Some prime q is a divisor of v. Hence, q^v. 
But q is different from po,p\ , . . . ,Pk- If q = pj, then pj\v and pj\pop\ . . -Pk 
would imply that pj\ 1 and, therefore, pj — 1 . Thus, pk+i^q^ 
(popi ■••Pk) + 1 . 
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3.26 If Goldbach’s conjecture is true, h is the constant function 2. If Gold- 
bach’s conjecture is false, h is the constant function 1. In either case, h is 
primitive recursive. 

3.28 List the recursive functions step by step in the following way. In the 
first step, start with the finite list consisting of Z(x), N(x), and U\ (x) . At the 
(n + l)th step, make one application of substitution, recursion and the /.i- 
operator to all appropriate sequences of functions already in the list after the 
nth step, and then add the n + 1 functions £/" +1 (x|, . . . ,x n+ i) to the list. 
Every recursive function eventually appears in the list. 

3.29 Assume f x (y) is primitive recursive (or recursive). Then so is 
f x (x) + 1. Hence, f x (x) + 1 is equal to fk{x) for some k. Therefore, 
f k (x) — f x (x) 4- 1 for all x and, in particular, fk(k) = fk(k) + 1. 

3.30 (a) Let d be the least positive integer in the set Y of integers of the 
form au + bv, where u and v are arbitrary integers - say, d = quq + bv 0 . Then 
d\a and d\b. (To see this for a , let a = qd + r, where 0 ^ r < d. Then 
r = a — qd = a — q{auQ + bv o) = (1 — quo)a + (—qvo)b G Y. Since d is the 
least positive integer in Y and r < d, r must be 0. Hence d\a.) If a and b are 
relatively prime, then d — 1. Hence, 1 = au 0 + bv 0 . Therefore, aito = 1 
(mod b). 

3.32 (a) 1944 = 2 3 3 5 . Hence, 1944 is the Godel number of the expression 

()• 

(b) 49 = 1 + 8(2’ 3 1 ). Hence, 49 is the Godel number of the function 
letter /,* . 

3.34 (a) g(fl) = 49 and g(a i) = 15. So, gtfM) = 2 49 3 3 5 15 7 5 . 

3.37 Take as a normal model for RR, but not for S, the set of polynomials 
with integral coefficients such that the leading coefficient is non-negative. 
Note that (Vx)(3y)(x = y -\-y V x = y + y + 1) is false in this model but is 
provable in S. 

3.38 Let oo be an object that is not a natural number. Let oo' = oo, oo 
+jc = x + oo = oo for all natural numbers x, oo • 0 = 0 • oo = 0, and 
oo • x = x • oo = co for all x / 0. 

3.41 Assume S is consistent. By Proposition 3.37(a), is not provable in S. 
Hence, by Lemma 2.12, the theory S g is consistent. Now, ~YS is equivalent 
to ( 3 * 2 )^/ (*2, r ^ n )- Since there is no proof of ^ in S, ^/{k,q) is false for 
all natural numbers k, where q = r ^ n . Hence, h s l Pf(A, q) for all natural 
numbers k. Therefore, h Sg “l^/(&, g). But, f= Sg (3x2)^/{x2,q). Thus S g is o>- 
inconsistent. 

3.45 (G. Kreisel, Mathematical Reviews , 1955, Vol. 16, p. 103) Let &(xj) 

be a wf of S that is the arithmetization of the following: x\ is the Godel 
number of a closed wf PA such that the theory S + {PA} is ^-inconsistent. 
(The latter says that there is a wf ${x) such that, for every n, S(h) is provable 
in S + {^}, and such that (3x)-i#(x) is provable in S + {^}.) By the fixed- 
point theorem, let ^ be a closed wf such that h s W 1 ). Let 

K — S + {^}. (1) ^ is false in the standard model. (Assume ^ true. Then K 



0 
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is a true theory. But, is true, since it is provable in S. So, 

${ r n ) is true. Hence, K is co- inconsistent and, therefore, K is not true, 
which yields a contradiction.) (2) K is co-consistent. (Assume K co-incon- 
sistent. Then .^( r is true and, therefore, ^ is true, contradicting (1).) 
3.46 (a) Assume the ‘function’ form of Church’s thesis and let A be an 

effectively decidable set of natural numbers. Then the character- 
istic function Ca is effectively computable and, therefore, recur- 
sive. Hence, by definition, A is a recursive set. 

(b) Assume the ‘set’ form of Church’s thesis and let /(xj, . . . ,x„) be 
any effectively computable function. Then the relation 
/(xj , . . . ,x„) = y is effectively decidable. Using the functions o k , af 
of pages 183-4 let A be the set of all z such that 
/(a 7 / 1 1 (z), . . . , <t" +, (z)) = <t”+{ (z). Then A is an effectively decid- 
able set and, therefore, recursive. Hence, /(x i,...,x„) 
= <t"+J (j.iz(Ca(z) = 0)) is recursive. 

3.48 Let K be the extension of S that has as proper axioms all wfs that are 
true in the standard model. If Tr were recursive, then, by Proposition 3.38, K 
would have an undecidable sentence, which is impossible. 

3.49 Use Corollary 3.39. 

3.50 Let /(xj,...,x„) be a recursive function. So, f(x \ , . . . ,x„) = y is a 
recursive relation, expressible in K by a wf stf(x \, . . . ,x n ,y). Then / is rep- 
resentable by srf{x\ , . . . , x n , y) A (Vz) (z < y =y ~ l stf (xj , . . . , x n , z)) , where 
z < y stands for z^y Az^y. 

3.53 (a) b 0 - 1=^. Hence, b ^?^( r 0 = 0 n ) and, 

therefore, I- —\&et#( r 0 — l n ). Thus, I- <£=> 

-]^W( r 0 = T n ). 

(b) b Also, 

( r 0 ] ), and SO, h @eev{ r ~ l^ n ) <==> n ). 

Hence b^3W?( r 0 n ) By a tautology, 

b 0=^(-\0=>(^A “]#)); hence, b^^( r 0 n )=> <2W( r ~YS 
=>(0 A ~I^) n ). Therefore, b @ew { r ^ ) => (^.^( r n^ n ) 
=>■ A ~1&) 1 )). It follows that b &et£>(y ^ "* ) ==> 

A But, h^Al^=>0 = l; so, \- &*u> 

( r (0 A n <gj n )=>@eu >{' r 0 = l n ). Thus, b 40W( r «P)=> 

= 1^), and b — = 1 ) =r* — I ^ ^ ~* ). 

Hence, b l<^( r 0 = I n )=^. 

3.56 If a theory K is recursively decidable, the set of Godel numbers of 
theorems of K is recursive. Taking the theorems of K as axioms, we obtain a 
recursive axiomatization. 

3.58 Assume there is a recursive set C such that 7k Q C and Ref^ C C. Let 
C be expressible in K by s/(x). Let with Godel number k, be a fixed point 
for -i^/(x). Then, b& “l . Since s${x) expresses C in K, b K stf{k) 

or bx —[sf(k). 
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(a) If Ek stf(k), then Ek . Therefore, k eRefit C C. Hence, 
Ek —\-^(k), contradicting the consistency of K. 

(b) If E k ~\s#{k), then E K So, k E T&E C and therefore, E K sY(k), 
contradicting the consistency of K. 

3.60 Let K 2 be the theory whose axioms are those wfs of Kj that are 
provable in K* . The theorems of K 2 are the axioms of K 2 . Hence, x e 7k 2 if 
and only if Fml Kl (x) A x € 7k* . So, if K* were recursively decidable — that is, 
if 7k* were recursive - 7k 2 would be recursive. Since K 2 is a consistent 
extension of Kj , this would contradict the essential recursive undecidability 

of Kj. 

3.61 (a) Compare the proof of Proposition 2.28. 

(b) By part (a), K* is consistent. Hence, by Exercise 3.60, K* is es- 
sentially recursively undecidable. So, by (a), K is recursively un- 
decidable. 

3.62 (b) Take (Vx)(i4j(x)<*==^x = x) as a possible definition of Aj. 

3.63 Use Exercises 3.61(b) and 3.62. 

3.64 Use Corollary 3.46, Exercise 3.63, and Proposition 3.47. 

CHAPTER 4 

4.12 (s) Assume u Gx x y. Then u = (v, w) = {{u}, (a, w}} for some v in x 

and w in y. Then v E XKjy and w E xuy. So, {i>} E 0*(xuy) and 
{v, w} E xuy ). Hence, {{z;}, {v, w}} E ^(^(xuy)). 

4.15 (a) 3>{x) C U(L)x) and ffl(x) C IJ(IJx). Apply Corollary 4.6(b). 

(b) Use Exercise 4.12(s), Exercise 4.13(b), axiom W, and Corollary 
4.6(b). 

(c) If Rel(T), then YE £)(Y) x M(Y). Use part (b) and Corollary 
4.6(b). 

4.18 Let X — {(yi,y2)bi ~yi Ayj E F}; that is, X is the class of all or- 
dered pairs (u, u) with u E Y. Clearly, Fnc(X) and, for any set x, 
(3i;)((t>, u) eX f\v E x)^=>u E Ynx. So, by axiom R, M(Ynx) 

4.19 Assume Fnc(F). Then Fnc(xfF) and ^(xfF)Cx. By axiom R, 
M (F“x). 

4.22 (a) Let 0 be the class \u\u ^ u}. Assume M(A). Then 0 C X. So, 0 = 

0 n X. By axiom S, M (0). 

4.23 Assume M(F)- Let F = {x|x^x}. It was proved above that -<M(F). 
But F C V. Hence, by Corollary 4.6(b), ->M(F). 

4.30 (c) Let w be the least e-element of X — Z. 

4.33 (a) By Proposition 4.11(a), Trans(co). By Proposition 4.11(b) and 

Proposition 4.8(j), co e On. If co e Kj then cj E co, contradicting 
Proposition 4.8(a). Hence, 

4.39 Let Xj = X x {0} and F, = F x {1}. 
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4.40 For any u C y, let the characteristic function C„ be the function with 
domain y such that C u l w = 0 if w G u and C u ‘ \v = 1 if w E y — u. Let F be the 
function with domain &(y) such that F'u = C u for u € y ) - Then ^(x)=2 y 

F 

4.41 (a) For any set u, c s(a) is a set by Exercise 4. 1 5(a). 

(b) Ifu G x y , then u C y x x. So, x y C 0>{y x x). 

4.42 (a) 0 is the only function with domain 0. 

(c) If 3){u) / 0, then M(u) / 0. 

4.43 Define a function F with domain X such that, for any xo in X, F(x 0 ) is 
the function g in X^ Uy such that g l u = xo- Then X ~ X^ . 

4.44 Assume X = Y and Z~ W. If ->M( IF), then -nM(Z) and X z = Y w = 0 

F G 

by Exercise 4.41(a). Hence, we may assume M(W) and M(Z). Define a 
function <I> on X z as follows: if / G X z , let O 4 / = F o / o G ~ l . Then X Z =Y W . 

4.45 If X or Y is not a set, then Z XuY and Z A x Z Y are both 0. We°may 
assume then that X and Y are sets. Define a function <D with domain Z XuY as 
follows: if / g Z XvjY , let <D ‘/ = {X f /, Y f /). Then Z Xv)Y £* Z x x Z Y . 

4.46 Define a function F with domain {x y ) z as follows: for any / in (x ) ’) z , 
let F‘/ be the function in x yxx such that (F‘fy(u,v) = (f i v) l u for all 
(u, v } G y x z. Then (x y ) z = x yxz . 

F 

4.47 If -.M(Z), (X x Y) z = 0 = 0 x 0 = X z x Y z . Assume then that 
M(Z). Define a function F: X z x Y z — > (X x Y) z as follows: for any 
/ eX z ,g e Y z ,(F‘(f,g))‘z = (f%g l z) for all z in Z. Then 
X z xY z ^(X x T) . 

4.48 This is a direct consequence of Proposition 4.19. 

4.54 (b) Use Bernstein’s theorem (Proposition 4.23(d)). 

(c) Use Proposition 4.23(c,d). 

4.55 Define a function F from V into 2 C as follows: F l u = {u, 0} if 
u / 0; F‘0 = {1,2}. Since, F is one-one, F =^2 C . Hence, by Exercises 4.23 and 
4.50, -iM(2 c ). 

4.56 (h) Use Exercise 4.45, 

(i) 2 X ^2 X + c x^2 A + c 2 X = 2 X x 2 ^ 2 X x 2 1 ^ 2 x+cl s* 2 X . 

Hence, by Bernstein’s Theorem, 2 X + c x = 2*. 

4.59 Under the assumption of the axiom of infinity, oj is a set such that 
(3«)(« G oj) A (Vy)(y G oj => (3z)(z G oj Ay C z)). Conversely, assume (*) 
and let b be a set such that (i) (3w)(« G b) and (ii) (Vy ) ( y G 
=> (3z)(z G b Ay C z)). Let J = (w|(3z)(z G b A w C z)}. Since d (U(^))> 
d is a set. Define a relation F = {(«, a)[« G oj A r = {n|w G J A w = «}}. Thus, 
(/ 2 , u) g F is and only if n G oj and i; consists of all elements of d that are 
equinumerous with n. R is a one-one function with domain co and range a 
subset of 0(d). Hence, by the replacement axiom applied to F -1 , cu is a set 
and, therefore, axiom I holds. 

4.62 (a) Induction on a in (Vx)(x = a A a G a ) => ¥in(fZ(x))). 
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(b) Induction on a in (Vx) (x = a A a G co A (Vj>) (v Fin(y)) 

FinflJ*)). 

(c) Use Proposition 4.27(a). 

(d) x C ^((Jx) and y Ex => y C |Jx. 

(e) Induction on a in (Vx)(x = aAfl£€(t)4 (x=^y V y=^x)) 

(g) Induction on a in (Vx)(x = a A a G co A Inf(F) => x=^T) 

(h) Use Proposition 4.26(c). 

(j) X- 3 ’ c ^( J X x) 

4.63 Let Z be a set such that every non-empty set of subsets of Z has a 
minimal element. Assume Inf(Z). Let Y be the set of all infinite subsets of Z. 
Then Y is a non-empty set of subsets of Z without a minimal element. 
Conversely, prove by induction that, for all a in co, any non-empty subset of 
^(a) has a minimal element. The result then carries over to non-empty 
subsets of Y?{z), where z is any finite set. 

4.64 (a) Induction on a in (Vx)(x = a A a G co A Den(y) => Den(xuy)). 

(b) Induction on a in (Vx)(x ~ a Ax ^ 0 A Den(y) => Den(x x y)) 

(c) Assume z C x and Den(z). Let z~ co. Define a function g on x as 

j' w j 

follows: g l n — u if u Ex — z; g'u — (fY{(f l u) ) if u E z. Assume x 
is Dedekind-infinite. Assume z Ex and x~z. Let v Ex — z. Define 
a function h on co such that h% = v and if a G co. 

Then h is one-one. So, Den(/7 u co) and /?“co C x. 

(f ) Assume y £ x. (i) Assume x u {_>'} = x. Define by induction a 
function g on co such that g% = y and g‘(n + l) = f < ‘{g l n). g is a 
one-one function from co into x. Hence, x contains a denumerable 
subset and, by part (c), x is Dedekind-infinite, (ii) Assume x is 
Dedekind-infinite. Then, by part (c), there is a denumerable subset 
z of x. Assume z=co. Let co^ (/ _1 )‘0- Define a function F as 
follows: F l u = u fo / u E x - z; F‘c 0 =y, F‘w= (/" 1 )‘((/‘w - 1) for 
u E z — {c 0 }. Then x ~ xujy}. If z is {c 0 , c \ , C 2 , . . .}, F takes q +( 

F 

into Ci and moves co into y. 

(g) Assume co=^x. By part (c), x is Dedekind-infinite. Choose y ^ x. 
By part (f), x=xu{y}. Hence, x + c 1 = (x x {0})u mm 

= X U {)'} = X. 

4.65 Assume M is a model of NBG with denumerable domain D. Let z be 
the element of D satisfying the wf x = 2 0J . Hence, z satisfies the wf ->(x = co). 
This means that there is no object in D that satisfies the condition of being a 
one-one correspondence between z and co. Since D is denumerable, there is a 
one-one correspondence between the set of ‘elements’ of z (that is, the set of 
objects v in D such that |=m v E z) and the set of natural numbers. However, 
no such one-one correspondence exists within M. 

4.68 NBG is finitely axiomatizable and has only the binary predicate letter 
A\. The argument on p. 269-70 shows that NBG is recursively undecidable. 
Hence, by Proposition 3.49, the predicate calculus with A\ as its only non- 
logical constant is recursively undecidable. 
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4.69 (a) Assume x=^<u H . If 2=^x, then, by Propositions 4.37(b) and 4.40, 

(o a ^xu x oj a ^ co a x oj a = co a . If x contains one element, 

use Exercise 4.64(c,f). 

(b) Use Corollary 4.41. 

4.70 (a) &(oj a ) x 0>{coa) = 2 (0a x 2 C0 * ^ 2 C0a+ ° (0 * ^ 2"“ s 
(b) {@{oj a )Y s (2 C) *Y = 2 CU ° X ' V £* 2 0J * s ^(co K ) 

4.71 (a) If y were non-empty and finite, y = y + c y would contradict Ex- 

ercise 4.62(b). 

(b) By part (c), let y — u u v, u rw = 0, u ~ y, v ^ y. Let y = y. Define a 
function 0 on ^(y) as follows: for x C y, let g l x = uUif^x). Then 
g l x C y and y = u=^g‘x=^y. Hence, g‘x^y. So, gr is a one-one 
function from ^(y) into yl = (z|z CyAz = y}. Thus, £^(y)=^4. 
Since A C <^(y), zl^(y). 

(e) Use part (d): (z|z CyAz = y} C (z|z C y A Inf(z)}. 

(f) By part (c), let y = uuv, unv = 0, u = y, v = y. Let w = v. Define / 

h 

on y as follows: /‘x = h l x if x £ u and f l x = (// _1 )‘x if x £ i>. 

4.72 (a) Use Proposition 4.37(b). 

(b) (i) Perm (y) C y->'^(2>') J ' = 2) ,yy S* 2 ? S* 0>{y). 

(ii) By part (a), we may use Exercise 4.7 (c). Let 

v = MUD,i/ni; = 0 ) M = v,D=y. Let u = v and v = u. Define a 

. . H . G 

function F: 0*(y) — » Perm (y) in the following way: assume 
z£^(y). Let ^ 2 :y— >y be defined as follows: if/^x 
= // c x if x £ G“z; ij/ z l x = (//’ )‘x if ( II ~ 1 ) ‘x £ G“z; t/Cx =x 
otherwise. Then i}/ z £ Perm(y). Let F l z = \\t z . F is one-one. 
Hence, &(y) ^Penn(y). 

4.73 (a) Use WO and Proposition 4.19. 

(b) The proof of Zorn => WO in Proposition 4.42 uses only this 
special case of Zorn’s Lemma. 

(c) To prove the Hausdorff maximal principal (HMP) from Zorn, 
consider some C-chain Co in x. Let y be the set of all c-chains C in 
x such that Co C C and apply part (b) to y. Conversely, assume 
HMP. To prove part (b), assume that the union of each non- 
empty C-chain in a given non-empty set x is also in x. By HMP 
applied to the C-chain 0, there is some maximal C-chain C in x. 
Then (J(C) * s an C -maximal element of x. 

(d) Assume the Teichmuller— Tukey lemma (TT). To prove part (b), 
assume that the union of each non-empty C-chain in a given non- 
empty set x is also in x. Let y be the set of all C-chains in x. y is 
easily seen to be a set of finite character. Therefore, y contains a C- 
maximal element C. Then |J(C) is a C-maximal element of x. 
Conversely, let x be any set of finite character. In order to prove 
TT by means of part (b), we must show that, if C is a C-chain in x, 
then U(C) € x. By the finite character of x, it suffices to show that 
every finite subset z of (J(C) is in x. Now, since z is finite, z is a 
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subset of the union of a finite subset W of C. Since C is a C-chain, 
W has a c -greatest element w G x, and z is a subset of w. Since x is 
of finite character, z € x. 

(e) Assume Rel(x). Let u — {z|(Eb)(a G @(x) Az = {t>}[ x}; that is, 
z G u if z is the set of all ordered pairs ( v , w) in x, for some fixed v. 
Apply the multiplicative axiom to u. The resulting choice set y C x 
is a function with domain @(x). Conversely, the given property 
easily yields the multiplicative axiom. If x is a set of disjoint non- 
empty sets, let r be the set of all ordered pairs (u, v } such that u G x 
and v G u. By part (e), there is a function / C r such that 

= @(r) = x. The range &(f) is the required choice set for x. 

(f ) By trichotomy, either x -< y or y -< x. If x -< y, there is a function 
with domain y and range x. (Assume x =yi C y. Take c G x. Define 

g l u — c if uEy — yi, and g ( u = (/ _, )‘w if w Gj/|.) Similarly, if 
y -< x, there is a function with domain x and range y. Conversely, 
to prove WO, apply the assumption (f) to x and J^‘(P^(x)). Note 
that, if (3f)(f : it —> v A &{f) = v), then ??{y) -< 0(h). Therefore, 
if there were a function / from x onto J^ l (^(x)), we would have 
^‘(^(x)) ~< P?('2f‘(PP(x))) -< P?(x) contradicting the definition of 
Jf‘(£?(x)). Hence, there is a function from J4? c (0(x)) onto x. Since 
x )) is an ordinal, one can define a one one function from x 
into Jf‘(£?(x)). Thus x -< J l f?‘(0 > (x)) and, therefore, x can be well- 
ordered. 

4.76 If < is a partial ordering of x, use Zorn’s lemma to obtain a maximal 
partial ordering <* of x with < C <*. But a maximal partial ordering must 
be a total ordering. (If u, v were distinct elements of x. unrelated by <*, we 
could add to <* all pairs (wi,i>i) such that W|<*w and v^*v\. The new 
relation would be a partial ordering properly containing <*.) 

4.79 (b) Since x x y = x + c y, x x y = aKjb with anb = </>, a ~ x, b ££ y. 

Let r be a well-ordering of y. (i) Assume there exists u in x such 
that (w, v) G a for all v in y. Then y^a. Since a = x, y<Cx, con- 
tradicting ^(y^x). Hence, (ii) for any u in x, there exists v in y 
such that ( u , v) G b. Define /: x — ► such that f l u == (w, v ), where v 
is the r-least element of y such that (w, v) G b. Since / is one-one, 
x^b £* y. 

(c) Clearly Inf(z) and Inf(x + c z). Then 

x T c z = (x + c z) 2 =r+ c 2x (ixz)+ c z 2 =x+ c 2x (xx z) + c z 

Therefore, x x z=^2 x (x x z)=^x + c 2 x (x x z) + c z = x + c z. Con- 
versely, x+ c z=^x x z by Proposition 4.37(b). 

(d) If AC holds, (Vy)(Inf(y) =^y = yxy) follows from Proposition 
4.40 and Exercise 4.73(a). Conversely, if we assume y~yxy for 
all infinite y, then, by parts (c) and (b), it follows that x=^ Jf‘x for 
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any infinite set x. Since ^x is an ordinal, x can be well-ordered. 
Thus, WO holds. 

4.81 (a) Let ( be a well-ordering of the range of r. Let f% be the (-least 

element of ^(r), and let /‘« be the (-least element of those v in 
$(r) such that {f l n, v) G r. 

(b) Assume Den(x) A (Vw)(w € x =>- u ^ 0). Let co = x. Let r be the set 

of all pairs (a, b ) such that a and b are finite sequences (t>o, V\ , . . . v„) 
and («d, i;],..., t^+i ) such that, for + l,Vj G g l i. Since 

M(r) C ££>(/'), PDC produces a function h: co — ► @(r) such that 
(h l n, h l (n')) G r for all n in co. Define the choice function / by taking, 
for each u in x, f‘u to be the (g l u)th component of the sequence 
h‘{g‘u). 

(c) Assume PDC and Inf(x). Let r consist of all ordered pairs 
(w, «u{fl}), where wu{c} C x, Fin(Mu{n}), and a^u. By PDC, 
there is a function f:co—> &(r) such that(/‘i7,/ < (n')) € r for all n 
in co. Define g: co —*■ x by setting g‘n equal to the unique element of 

Then g is one-one, and so, co^x. 

(d) In the proof of Proposition 4.44(b), instead of using the choice 
function h, apply PDC to obtain the function /. As the relation r, 
use the set of all pairs (m, v ) such that u G c, v G c, v G unX. 

4.82 (a) Use transfinite induction. 

(d) Use induction on /?. 

(e) -(f) Use transfinite induction and part (a). 

(h) Assume u C H. Let v be the set of ranks p l x of elements x in u. Let 
fi = Uv. Then u C ¥'/?. Hence u g p ) = *F‘(/?) C H .. 

4.83 Assume X ± 0 A -i(3y)(y G X f\ycxX = 0). Choose hGX Define a 
function g: g l Q = unX, g l (n') = U(g l n)nX. Let x = U(0t(g)). Then 
x ^ 0 and (Vy)^ € x =>■ ynx ^ 0). 

4.88 Hint: Assume that the other axioms of NBG are consistent and that 
the Axiom of Infinity is provable from them. Show that H (0 is a model 
for the other axioms but not for the Axiom of Infinity. 

4.89 Use H i0 +o 0} 

4.95 (a) Let C {x| -i(3y)(x G y Ay G x)}. 

CHAPTER 5 

5.1 q 0 |Bq 0 
q 0 B Rq, 

Qi l|qo 

qiBRq2 

5.2 (a) U\ (b) <5(x) 

5.7 Let a Turing machine ^ compute the function /. Replace all occur- 
rences of q 0 in the quadruples of J* by a new internal state q r . Then add the 


ANSWERS TO SELECTED EXERCISES 


407 


quadruples q 0 <3/«;q r for all symbols a, of the alphabet of ZZ. The Turing 
machine defined by the enlarged set of quadruples also computes the 
function /. 

5.10 (a) N(x ) = x + 1 (c) 2x 

5.12 (a) 


,▼ 1 l 

&(Kj ) 2 1 hq I — ► Sfranr — 01 


0 


0 


j5?K 3 C 


5.14 (a) The empty function (b) N(x) = x+ 1 (c) Z(x) 

5.16 If f(a\) = bi , . . . ,f(a„) = b„, then 

f(x) — gy[(x = a\ Ay = b\) V ■ • • V {x = a„ A y — b n )} 

5.20 Let g(z,x) = U(gyT\ (z,x,y)) and use Corollary 5.11. Let Vq be a 
number such that g(x,x) + 1 = g(vo,x). Then, if g(vo, vq) is defined, 
g(v o, ro) + 1 = g(vo, L’o), which is impossible. 

5.21 g(x i ) • • • j h\ (^i i • • • ’ sg(C/j| (-^■l > • • • ) T • • • 

h k (xi , . . . ,x„) • sg{C Rk (x\ , . . . ,x„)) 

5.22 (a) Assume that h(x) is a recursive function such that h(x) = 

gyTi(x,x,y) for every x in the domain of gyT\ (x, x,y). Then 
(3y) 7) (x, x, y) if and only if 7 \ (jc, jc, / r ( jc) ) . Since T\(x,x,h(x)) is a 
recursive relation, this contradicts Corollary 5.13(a). 

(b) Use Exercise 5.21. 

(c) Z(fj,yT\(x,x,y)) is recursively computable, but its domain is 
{x|(3y)7i (x,x,y)}, which, by Corollary 5.13(a), is not recursive. 

5.29 Let ZZ be a Turing machine with a recursively unsolvable halting 
problem. Let a* be a symbol not in the alphabet of ZZ . Let q, be an internal 
state symbol that does not occur in the quadruples of ZZ . For each q ( of ZZ 
and a j of ZZ, if no quadruple of ZZ begins with q,a 7 , then add the quadruple 
q / a / a^q r . Call the new Turing machine ZZ* . Then, for any initial tape de- 
scription a of , ZZ* , begun on a, prints a k if and only if ZZ is applicable to 
a. Hence, if the printing problem for ,T* and a k were recursively solvable, 
then the halting problem for ZZ would be recursively solvable. 

5.31 Let ZZ be a Turing machine with a recursively unsolvable halting 
problem. For any initial tape description a for ZZ , construct a Turing ma- 
chine ZZ a that does the following: for any initial tape description /?, start ZZ 
on a; if ZZ stops, erase the result and then start ZZ on /?. It is easy to check 
that ZZ is applicable to a if and only if ZZ a has a recursively unsolvable 
halting problem. It is very tedious to show how to construct ZZ a and to 
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prove that the Godel number of a is a recursive function of the Godel 
number of a. 

5.33 Let Do be the index of a partial recursive function G(x) with non-empty 
domain. If the given decision problem were recursively solvable, so would be 
the decision problem of Example l on page 332 

5.34 By Corollary 5.16, there is a recursive function g(u) such that 
<pL/)( x ) = x - gyT\ (u, u,y). Then has an empty domain if and only if 
-i(3y)7i(w, u,y). But, -i(E3y)7i (w, u,y) is not recursive by Corollary 5.13(a). 
5.39 (a) By Corollary 5.16, there is a recursive function g{u) such that 

<y>J( t ,)(x) ~ = u ~ *)■ T he domain of is {«}. Apply 

the fixed-point theorem to g. 

(b) There is a recursive function g{u) such that <pl( u ) (x) = 
gy( x ^ u Ay = 0). Apply the fixed-point theorem to g. 

5.42 (a) Let A — (x| / (x) e B }. By Proposition 5.21(c), B is the domain of a 

partial recursive function g. Then A is the domain of the compo- 
sition go f. Since g o f is partial recursive by substitution, A is r.e. 
by Proposition 5.21(c). 

(b) Let B be a recursive set and let D be the inverse image of B under a 
recursive function /. Then x E D if and only if C B (f(x)) = 0 and 
Cb(/(x)) = 0 is a recursive relation. 

(c) Let B be an r.e. set and let A be the image {/(jt)|x <G B} under a 
partial recursive function /. If B is empty, so is A* If B is non- 
empty, then B is the range of a recursive function g. Then A is the 
range of the partial recursive function f(g(x)) and, by Proposition 
5.21(b), A is r.e. 

(d) Consider part (b). Given any natural number x, compute the value 
f(x) and determine whether f(x) is in B. This is an effective pro- 
cedure for determining membership in the inverse image of B. 
Hence, by Church’s thesis, B is recursive. 

(e) Any non-empty r.e. set that is not recursive (such as that of 
Proposition 5.21(e)) is the range of a recursive function g and is, 
therefore, the image of the recursive set to of all natural numbers 
under the function g. 

5.43 The proof has two parts: 

1 . Let A be an infinite recursive set. Then A is the range of a recursive 

function /, by Proposition 5.21(d). Since A is infinite, 
h(u) = fiy{f{y) > u) is recursive. Let oq be the least element of A. 
Define g(0) = <z 0 > + 1) = f(h(g(n)). Then g is a strictly in- 

creasing function with range A. 

2. Let A be the range of a strictly increasing recursive function g. 
Then g(x) > x for all x (by the special case of Proposition 4.15). 
Hence, x e A if and only if(3i/) w ^ x g( u) = x. So, A is recursive by 
Proposition 3.18. 
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5.44 Assume A is an infinite r.e. set. Let A be the range of the recursive 
function g(x). Define the function / by the following course-of- values re- 
cursion: 

/(«) = g(w({V z )z<„g(y) ^/( z ))) = g(M(^ z ) z <„g(y) / (/"#(«))*)) 

Then A is the range of h, h is one-one, and h is recursive by Propositions 
3.18 and 3.20. Intuitively, /(0) = g(0) and, for n > 0,f(n) — g{y), where y is 
the least number for which g(y) is different from /(0),/(l), ...,/(« — 1). 

5.45 Let A be an infinite r.e. set, and let A be the range of the recursive 
function g. Since A is infinite, F(u) = gy{g(y) > u) is a recursive function. 
Define G(0) = g(0),G(n + 1) = g(gy(g(y) > G(n))) = g(F(G(n))). G is a 
strictly increasing recursive function whose range is infinite and included in 
A. By Exercise 5.43, the range of G is an infinite recursive subset of A. 

5.46 (a) By Corollary 5.16, there is a recursive function g{u,v) such that 

<P ] g{iliD \ (*) = ny(Ti(u,x,y) V T, (u,x,y)). 

5.47 Assume (V). Let f(x\, ... ,x„) be effectively computable. Then the set 
B = {«!/((«) j, . . . , (w)„) = (m) h , j} is effectively enumerable and, therefore, 
by (V), r.e. Hence, u E B<=^(3y)R(u,y) for some recursive relation R. Then 

f(xi,...,x„) = (M((^)o)l =*1 A - - - A ((»)©)* = *» A/?((y)o,(u)i))]o), l+ i 

So, / is recursive. Conversely, assume Church’s thesis and let W be an 
effectively enumerable set. If W is empty, then W. is r.e. If W is non-empty, 
let W be the range of the effectively computable function g. By Church’s 
thesis, g is recursive. But, XE W<=^(3u)(g(u) — x). Hence, W is r.e. by 
Proposition 5.21(a). 

5.48 Assume A is r.e. Since A / 0, A is the range of a recursive function g(z). 
So, for each z, U{jiyT\ (g(z),x,y)) is total and, therefore, recursive. Hence, 
U(f.iyT\ (g(x),x,y)) + 1 is recursive. Then there must be a number zq such 
that U(fiyT\(g(x),x,y)) + 1 = U(/.iyT\(g(zo),x,y)). A contradiction results 
when x = zq. 

5.49 (a) Let (p(n) — n for all n. 

5.50 Let <p(z) = o\(gy[T\ (z, <7j(y), o^iy)) A oj(y) > 2z]), and let B be the 
range of (j). 

5.55 (b) Let A be r.e. Then x E where R is recursive. Let 

&(x,y) express R(x,y) in K. Then k £ A<^=> Ek (3y)3%(k,y). 

(c) Assume k E A^=^ E K -v/(k) for all natural numbers k. Then k E A 
<s=^(3 y)B.</(k,y) and B P j is recursive (see the proof of Proposition 
3.29 on page 199. 

5.56 (a) Clearly 7k is infinite. Let f(x) be a recursive function with range 

7k- Let #0,^1,... be the theorems of K, where 66 j is the wf of K 
with Godel number f(j). Let g{x,y) be the recursive function such 
that, if x is the Godel number of a wf then g{x,j) is the Godel 
number of the conjunction # A ^ A • • • A ^ consisting of j con- 
juncts; and, otherwise, g{xj) = 0. Then g{f(j),j) is the Godel 
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number of the y'-fold conjunction 88 j A 88 j A ■ • ■ A 88 j. Let K' be the 
theory whose axioms are all these j-fold conjunctions, for 
j — 0, 1, 2, . . . Then K' and K have the same theorems. Moreover, 
the set of axioms of K' is recursive. In fact, x is the Godel number 
of an axiom of K' if and only if x / 0 A (3y)^ v (g(/(y),y) — jc). 
From an intuitive standpoint using Church’s thesis, we observe 
that, given any wf one can decide whether stf is a conjunction 
<€ t\<€ t\ ■ ~ N<€\ if it is such a conjunction, one can determine the 
number j of conjuncts and check whether ^ is 88 j. 

(b) Part (b) follows from part (a). 

5.58 (a) Assume 88{x\) weakly expresses (7k)* in K. Then, for any 

n, F K 88 {n) if and only if/? e (7’k)*. Let p be the Godel number of 
88(x\). Then Fk &(p) if and only if p € (7’k)*- Hence, Fk 88{p) if 
and only if the Godel number of 88{p) is in 7’k; that is, F K 88{p) if 
and only if not-FK &(p)- 

(b) If Kis recursively decidable, 7k is recursive. Hence, 7’k is recursive 
and, by Exercise 5. 57, (7’k)* is recursive. So, (7’k)* is weakly ex- 
pressible in K, contradicting part (a). 

(c) Use part (b); every recursive set is expressible, and, therefore, 
weakly expressible, in every consistent extension of K. 

5.59 (a) (i) <5(x). 

(ii) xi —X2 

(iii) The function with empty domain. 

(iv) The doubling function. 

(b) (i) /f(xi,0)=*i 

/?(0,x 2 ) -x 2 

f\(( x i / » fa )')=f\{ x 1 ^ 2 ) 

(ii) i,0)= xi 

f\{x 1, fo)') = {f\(x \,X2))' 

ffc i»0) = 0 

(iii) fl (0) = I 

= 0 
fi (0) - 0 

5.61 (a) Any word P is transformed into QP. 

(b) Any word P in A is transformed into PQ. 

(c) Any word P in A is transformed into Q. 

(d) Any word P in A is transformed into /?, where n is the number of 

symbols in P. 



5.62 (a) a£ 
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•A in A) 

• A 
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(b) a£ 

fa 


fa (f in A) 
•A (f in A) 
• A 


(c) £ — > A (f in A) 


(d) 


af 

P 


nPZ (f > n in A) 

f/?fa (f in A) 


■ A 


5.63 


Q,a (/ = 1 , . . . , A:) 
fa (finA-{m, 
• A 


. «*}) 


5.64 (d) |B| 

B - 


(e) |B| — »• | 

(f) Let a, p and 5 be new symbols. 

P l-l/f 

a| — > \pa 
a. — » A 
||<5 -» |<5a 

l<5-| 

<5|| - <5| 
<51-1 
6 -> | 

P — | 

|B| — > <5 




Listed here are not only books and papers mentioned in the text but also 
some other material that will be helpful in a further study of mathematical 
logic. We shall use the following abbreviations. 

AML for Annals of Mathematical Logic 

AMS for American Mathematical Society 

Arch. for Archiv fiir mathematische Logik and Grundlagenf or seining 
FM for Funclamenta Mathematicae 

HML for Handbook of Mathematical Logic, Springer- Verlag 
HPL for Handbook of Philosophical Logic, Reidel 

JSL for Journal of Symbolic Logic 

MTL for Model-Theoretic Logics , Springer- Verlag 
NDJFL for Notre Dame Journal of Formal Logic 

NH for North-Holland Publishing Company 

ZML for Zeitschrift fur mathematische Logik unci Grundlagen cler 
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Boolean 9 
cylindrical 123 
Lindenbaum 49 
polyadic 123 

Algebraically closed fields 1 1 9 
Algorithm 305, 351 
closed 357 
(fully) equivalent 356 
Markov 352 
normal 352 
over an alphabet 351 
schema 352 
Turing 308 

Algorithmically solvable 328 
a-sequence 286 

Alphabet of a Turing machine 305 

Alternative denial 29 

Analysis, nonstandard 136 

And 11 

Antecedent 1 2 

Applicable 308, 351 

AR2 373 

Argument strip 319 
Arguments, logically correct 26 
Arguments of a function 7 
Arithmetic, langauge of 154 
Arithmetical 
hierarchy 333 
relation 1 90 
set 217 


Arithmetization 190, 192, 349 
Arrows in diagrams 311 
Associativity 

of conjunction, disjunction 23 
Atom 297 
Atomic formula 52 
Auxiliary letter 345 
Axiom 34 

of choice 9, 275 

of class existence 230 

comprehension scheme 291, 294 

extensionality 290, 294 

finite, of choice 277 

Fundierungs- 279 

of infinity 239, 288, 292 

logical 69 

multiplicative 275 

null set 228, 288 

pairing 228, 288 

power set 236, 288 

proper (nonlogical) 69, 70 

of reducibility 293 

of regularity 279, 288 

of replacement 239, 288 

schema 36 

selection 28 8 

set (primitive recursive, recursive) 197 
of subsets 236 
sum set 236, 288 
Axiomatic theory 34, 21 1 
Axiomatizablc 
finitely 94 
recursively 211 

Axiomatization, independent 94 

Basic principle of semantic trees 143 
Bernstein’s theorem 8, 255 
Berry’s paradox 3 
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Beta function of Godel 186 
Biconditional 1 3 

associativity, commutativity 23 
elimination, introduction, 
negation 78 
rules 78 

Binary relation 6 
Blank square 306 
Blatant contradiction 32 
Boolean algebra 9 
Boolean representation theorem 121 
Bounded 
^-operator 
quantifiers 179 
sums and products 178 
Bound occurrence 53 
Bound variables 53 
change of 85 
Branch 1 42 
closed 142 
Brouwer, L.E.J. 4 
Burali-Forti’s paradox 2, 4 

Cantorian ("strongly) 295 
Cantor’s paradox 2, 4, 257, 295 
Cantor’s theorem 2, 257, 295 
Cardinal 

arithmetic 27 1 
Frege-Russel 257 
number 2, 8, 279, 282 
sum 258 

Cartesian product 6, 233 
Categorical 112 
Categoricity of AR2 374 
Category theory 295 
Causal laws 1 2 
Chain 276 

Change of bound variables 85 
Characteristic of a field 117 
Characteristic function 173 
Chinese remainder theorem 190 
Choice 

axiom of, (AC) 9, 275 
denumerable axiom of 280 
finite axiom of 277 
function 275 

principle of dependent 280 
set 9, 275 

universal, function 278 
Church, A. 

Church’s theorem 222 
Church’s thesis 211, 326 


Circuit, electrical 24 
Class 5, 225 
existence axioms 230 
finite 259 

general, existence theorem 232 

ordinal 243 

power 234 

proper 226 

sum 234 

universal 231 

‘Classical’ sense of existential 
quantifier 357 
Clean-up machine (C) 315 
Closed 

normal algorithm 357 
set 140 
term 87 
wf 58 
Closure 

transitive, 280 
(universal), of a formula 61 
Commutativity 

of biconditional, conjunction, 
disjunction 23 

Compactness theorem 93,136 
failure of, in standard second- 
order logic 377 

validity of, for general models 380 
Compatible theories 220 
Complement 23 1 
relative 5 
Complete 
diagram 1 27 
induction 8, 9, 166 
NP- 31 
theory 86 

Completeness theorem 
general second-order 379 
generalized 121 
Godel’s 91 
for L 42 

Composition 7, 241, 357 
normal, of algorithms 358 
Comprehension axiom scheme 291, 
294 

in second-order theories (Comp) 372 
Computable 
a -, 361 

Herbrand-Godel 346 
Markov- 356-7 
standard Turing- 319 
Turing- 309 
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Computation of a Turing machine 308 
Conditional 1 2 
contrapositive 77 
counterfactual 1 2 
elimination, introduction 77 
function (Cond) 1 96 
rules for the 77 
Conjunct 1 1 
Conjunction 1 1 

associativity, commutativity 23 
elimination, introduction 77 
rules 77 

Conjunctive normal form (cnf) 30 
Connected relation 240 
Connective 13, 44 
primitive 35 
principal 14 
Consequence 34 
direct 34 
logical 16, 66 

standard second-order logical 370 
Consequent 12 
Conservative extension 289 
Consistency 72 
absolute 43 
ofL 42 

of a predicate calculus 72 
of S 160, 212 
co- 205 
Constant 

individual 5 1 
nonlogical 57 
(Turing) machine 313 
Continuous 139 
uniformly 141 
Continuum 8 
hypothesis 284 
generalized, hypothesis 284 
Contracted model 100 
Contradiction, proof by 78 
Contradictory 18, 65 
Contrapositive 23, 77 
Correlate 357 
Correspondence, one-one 7 
Countable 8, 261 
Counterfactual conditional 1 2 
Course-of-values recursion 185 
Cowen, R. 32 

Craig’s interpolation theorem 33 
Creative 342 
Cretan ‘paradox’ 2 
Cylindrical algebras 123 


Decidable 

effectively 211 
recursively 216 
theory 34, 362 
wf 169 

Decision problem 361 
Dedekind, R. 1 54 

Dedekind-finite, Dedekind-infinite 26 1 
Deduction 35 
Deduction theorem 

for first-order theories 73 — 4 
for L 37 

Definite description 106 
Definition 
by cases 1 82—3 
of new function letters 
and constants 103 
possible 223 

by transfinite induction 249 
De Morgan’s law 23 
Densely-ordered sets, theory of 98 
Denumerable 8, 261 
axiom of choice 280 
model 90 
sequence 8 

Dependent choice, principle of 280 
Depends 73 

Derivability conditions 213 
Derivable from a set 
of equations 346 
Derived rules 76-7 
Designated values 44 
Detachment rule 35 
Diagonal function 197 
Diagonalization lemma 203 
Diagram (complete) of a model 127 
Diagrams of Turing machines 31 1 
Difference 5, 231 
Direct consequence 34 
Discharged wf of a semantictree 1 42 
Disjoint sets 5 
Disjunct 12 
Disjunction 12 

associativity, commutativity 23 
elimination, introduction 77 
rules 77 

Disjunctive normal form (dnf) 30 
Distributive law 23 
Domain 
empty 147 

of an interpretation 57 
of a relation 6, 231 



INDEX 


Downward Skolem-Lowenheim 
theorem 1 28 

failure of, in standard second- 
order logic 377 

validity of, for general models 380 
Duality 23 
Dummy variables 1 76 
Dyson, V.H. 171 

Easton, W.B. 279 
Effectively computable function 200 
Effectively decidable 211 
Electric circuit 24 
Element 1, 5, 296 
Elementarily equivalent 123 
Elementary 
class of models 1 36 
extension 125 
submodel 125 
substructure 1 25 
theory 98 

theory of groups, fields, ordered 
fields 98 

Elimination of existential 
quantifiers 1 17 
Empty 
domain 147 
function 326 
set 5 
word 306 
Epimenides 2 
Equality 

in second-order languages 369 
in set theory 226 
in type theory 290 
pure first-order theory of 98 
reflexivity of 95 
substitutivity of 95 
theory with 94-5, 99 
Equation 345 
Equinumerous 7, 253 
Equivalence 
class 6 
logical 16 
recursive 343 
relation 6 
theorem 79 
Equivalent 

elementarily, interpretations 123 
(fully), algorithms 3 56 
logically 16, 66 
recursively 343 


Essential incompleteness 211 
Essential recursively 
undecidable 216 
Exclusive ‘or’ 11 
Existential 
quantifier 50 
rule E4 77 

Exponentiation, ordinal 250 
Expressible relation 170 
weakly 344 
Expression 34, 321 
Extension 

of an alphabet 351 
conservative 289 
elementary 125 
finite, of a theory 219 
of a model 124 
submodel 125 
substructure 125 
of a theory 86 
Extensionality 290, 294 
axiom 290, 294 
principle 227 
extremal clause 35 

False 

for an interpretation 60 
for a standard second-order 
interpretation 370 
logically 1 8 

Field of a relation 6, 242 
Fields 

algebraically closed 119 
elementary theory of 98 
ordered 98 
real-closed 362 
Filter 129 

proper, improper, principal 129 
ultra- 1 30 
Finitary 36 
Finite 

axiom of choice 
Dedekind- 261 
e -cycles 279 
extension 219 
intersection property 129 
marriage problem 1 19 
ordinal 259 
presentation 364 
Ramsey theorem 210 
sequence 8 
set 8, 259 
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Finitely 

axiomatizable theory 94 
presented group 366 
First-order language 56 
generalized 1 14 

First-order predicate calculus 70 
First-order vs. second- 
order logic 381—2 
full 221 
pure 22 1 

First-order theory 69 

of densely ordered sets 98 
of equality 98 
with equality 94 5 
generalized 114 
Fixed-point theorem 204 
Fixed-point theorem in recursion 
theory 335 
F-less transform 104 
Follows from 34 
Form, statement 13 
Formal 

number theory 1 54 
theory 18, 34 
Formula 
atomic 52 
well-formed 34, 52 
Fraenkel, A.A. 288 
Free 

occurrence 53 
variable 53 
for Xj in a formula 54 
Frege-Russell cardinal numbers 257 
F-transform 104 

Full first-order predicate calculus 221 
Full general model 379 
Full normal form 31 
Full Second-order language 369 
pure 370 

Fully equivalent algorithms 356 
Function 6, 238 
characteristic 173 
conditional (Cond) 1 96 
definition of new, letters 103 
diagonal 197 
effectively computable 200 
empty 326 
Godel’s beta 1 86 
Herbrand-Godel (HG)- 
computable 346 
initial 174 
into 7 


Function ( continued) 
juxtaposition 182 
letter 5 1 

Markov-computable 3 56-7 
maximum, minimum 177 
negation (Neg) 196 
number-theoretic 1 70 
one-one 7 
onto 7 
partial 7, 309 
partial recursive 318 
predecessor 177 
primitive recursive 175 
projection 174 
quotient 177 
recursive 1 7 5 

recursively completable 328 
remainder 177 
(strongly) representable 171 
successor 174 
total 7, 309 
truth 14—5 

Turing-computable- 309 
variables 368 
zero 174 

Function definition schema 
(Fundef) 372 
Fundierungsaxiom 279 

Gch, see Generalized continuum 
hypothesis 
Gen 70 

General class existence theorem 232 
General model 379 
Full 379 

General recursive 175 
General second-order 
completeness 379 
Generalization (Gen) rule 70 
second-order 372 

Generalized completeness theorem 121 
Generalized continuum hypothesis 284 
Generalized first-order 
language 114 
theory 1 14 
Generally 

implies, is equivalent to 379 
satisfiable, valid 379 
Generators 364 
Godel, K. 

Herbrand-Godel-computable 346 
number 190, 321 
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Rosser theorem 208 -9, 219 
sentence 206 
GodeFs 

/^-function 186 
completeness theorem 91 
incompleteness theorem 206 
second theorem 212, 215 
Graph 118 

Graph of a function 174 
Grelling’s paradox 3 
Groups 

finitely presented 366 
orderable 119 
theory of 71, 98 

Halting problem 328 
self- 329 
special 329 
Hartogs’ function 264 
Hartogs’ theorem 263 
Hausdorff maximal principle 277 
Henkin, L. 

second-order interpretation 378 
second-order semantics 378 
sentence 213 
Henkin’s lemma 380 
Herbrand, J. 345 
Herbrand-Godel-(HG)- 
computable 346 
Heterological 3 
Higher-order 
languages 56 
theories 56, 381 
Hilbert, D. 381 

Bernays derivability conditions 213 
Hilbert’s tenth problem 305, 363 
Hyp 38 
Hypothesis 35 
inductive 8 

Ideal (maximal, proper) 9 
Identifying variables 176 
Identity element 364 
Identity relation 6, 234 
Image 7 
inverse 7 
Immune 343 
Implication, logical 16, 65 
Impredicatively defined set 293 
Improper filter 129 
Inaccessible ordinal 283 
strongly 286 


Inaccessible ordinal ( continued) 
weakly 286 
Inclusion 5, 226 
Inclusive ‘or’ 11 
Inclusively valid 148 
Incompleteness 
essential 211 

Godel-Rosser, theorem 208 
Godel’s theorem 206 
of standard second-order 
semantics 376 
co- 208 

Inconsistent theory 72 
Increasing function 251 
Increasing ordinal a-sequence 286 
Independence 43 
Independent axiomatization 94 
Index 330, 341 
Individual 227, 297 
constants 51 
variables 5 1 
Induction 

complete 8, 9, 166 
mathematical 8 
principle 8, 154-5 
rule 155 

transfinite 9, 245, 248-9 
up to co, up to <5 248 
Inductive hypothesis 8 
Inference, rules of 34 
Infinite 8, 261 
Dedekind- 261 
ordinal 259 
Infinitely dose 138 
Infinitely descending e-sequences 279 
Infinitesimal 136 
Infinity, axiom of 239 
in type theory 292 
Initial 

functions 174 
letter 345 
ordinal 264 
state 307 

tape description 307 
vertex 311 
Inner model 282 
Inseparable, recursively 219 
Instance 61 
Internal state 307 
Interpolation theorem 33 
Interpretable 223 
relatively 224 
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Interpretation 57 

Henkin second-order 378 
standard 160 
standard second-order 370 
Intersection 5, 231, 237 
Intuitionism 4 

Intuitionistic propositional calculus 48 
Inverse 7 
Image 7 

lexicographical ordering 272 
relation 6, 235 
of a word 353 
Iota term 106 
Irreflexive 240 
Isolated 343 
Isomorphic 

interpretations 111 
recursively 342 
Iteration theorem 330 
for models of AR2 374 
Iterative conception of set 282 

Joint denial 29 

Juxtaposition function 181-182 

k-colourable graph 118 
Kleene, S.C. 

-Mostowski hierarchy 333 
Normal form theorem of 326 
Konig’s Unendlichkeitslemma 118 
Kreisel, G, 399 
k-valid 93 

/-computability 361 

L 35 

language 

of arithmetic 1 54 
first-order 56 
generalized first-order 114 
higher-order 56 
meta- 36 
object 36 

law of the excluded middle 4, 16 
Least 

element 9, 245 
number principle 166 
Left 

-end machine 315 
machine 313 
-translation machine 315 
Leibniz, G.W. 65 


Length of an expression 181 
Letter 

auxiliary 345 
function 51 
initial 345 
predicate 5 1 
principal 345 
statement 13, 35 
Liar paradox 2 
Limit ordinal 247 
Lindenbaum, A. 
algebra 49 

Lindenbaum’s lemma 86 
Literal 30 
Lob, M.H. 

Lob’s paradox 3 
Lob’s theorem 214 
Logic 1 

many-valued 44—5 
second-order 368 
third and higher-order 369 
Logical 

axioms 69 
consequence 16, 66 
implication 16 
equivalence 1 6 
paradoxes 3 

standard, consequence 370 
validity 362 
Logically 

correct arguments 26 
equivalent 16, 66 
false 1 8 
imply 16, 65 

standardly second-order, imply 370 
true 1 8 
valid 65 
Logicism 291 
Los’ theorem 133 
Lowenheim, L. 

Downward Skolem-Lowenheim- 
Tarski theorem 128 
Skolem-Lowenheim theorem 92 
Upward Skolem-Lowenheim- 
Tarski theorem 128 

/^-operator (mu-operator) 175 
bounded 179 
unrestricted 318 
Machine, Turing 306 
clean-up 315 
constant 313 
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Machine, Turing (continued) 
left 313 
left-end 315 
left-translation 315 
n-shift copier 316 
right 313 
right-end 314 
shift 3 1 5 

super-universal 332 
universal 3 32 
word-copier 316 
Many- one 
equivalent 343 
reducible 342 
Many-valued logic 44 5 
Maps 7 
Markov, A.A. 
algorithm 352 
-computable 356-7 
Marriage problem 1 1 9 
Mathematical induction 8, 154-5 
Mathematical logic 1, 4 
Maximal ideal 9 
theorem 1 21 
Maximum function 177 
m -categorical 1 1 2 
Mechanical procedure 211 
Member 1, 5 

Membership relation 225, 242 
Metalanguage 36 
Metamathematics 36 
Metaproof, metatheorem 36 
Method of infinite descent 167 
Minimum function 177 
Minimal ( m ax im al) element 263 
ML 296 
Model 60, 70 
contracted 100 
denumerable 90 
(full) general 379 
inner 282 
nonstandard 160 
normal 100 
standard 160 
Modus ponens (MP) 34—5 
Moll, D. 385 

Monadic predicate calculus, pure 222 
Monadic predicate letters 51 
Morse-Kelley set theory (MK) 287 
Mostowski, A. 287 
Kleene-, hierarchy 333 
Moves 307 


MP, see Modus ponens 
Multiplication, ordinal 250 
Multiplicative axiom (Mult) 275 

Natural number 154 
NBG 225 
Negation 1 1 

elimination, introduction 77 
function (Neg) 196 
rules 77 

NF (Quine’s New Foundations) 293 
NFU 296 
Non-class 227 
Nonlogical 
axioms 69—70 
constants 57 
Nonstandard 160, 295 
analysis 136 
model 160, 295 
reals 1 37 
Normal 

algorithm 352 
closed, algorithm 357 
composition 358 
forms 30 
model 100 
prenex, form 106 
Skolem, form 109 
Normal form theorem, Kleene’ s 326 
NP-complete 31 
N-shift copier (K„) 316 

Null set 5 
axiom 228 
Number 

cardinal 2, 8, 279, 282 
of divisors 1 79 
Godel 190, 321 
natural 1 54 
ordinal 243 
Number-theoretic 
function 170 
relation 1 70 
Numeral 160, 345 
Numerical tape description 323 

Object language 36 
Occurrence (free, bound) 53 
Occurs 352 
oj 246 

-consistency 205 
-incompleteness 208 
On 243 
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One-one 7 
correspondence 7 
equivalent 343 
function 238 
reducible 343 
Open 
set 140 
wf 68 

Operation, n-place 7 
Or 11 
Order 
partial 8 
total 9, 242 
type 242 
well- 9 

Orderable group 119 
Ordered 
fields 98 
k-tuple 5, 230 
pair 5, 229 
Ordinal 

a-sequence 286 
addition 249 
class 243 
exponentiation 250 
finite 259 
of first kind 246 
inaccessible 283 
infinite 259 
initial 264 
limit 247 
multiplication 250 
number 243 
regular 286 
singular 286 
strongly inaccessible 286 
successor 246 
weakly inaccessible 286 
Owings, J.C., Jr. 12 

PA, see Peano arithmetic 
Pair 

ordered 5, 229 
unordered 5, 228 
Pairing axiom 228 
Paradox 
Berry’s 3 
Burali-Forti’s 2, 4 
Cantor’s 2, 4 
Cretan 2 
Grelling’s 3 
liar 2 


Paradox ( continued) 

Lob’s 3 
logical 3 
Richard’s 2 
Russell’s 1 , 4 
semantical 3 
Skolem’s 263 

Parameters of a recursion 175 

Parentheses 20, 52 

Partial 

function 7 
order 8, 71, 240 
recursive 318 

Particularization rule A4 76 
Peano arithmetic (PA) 155 
Peano’s postulates 154 
categoricity of 169 
Permutation, recursive 342 
Permuting variables 176 
PF 221 

Poincare, H. 293 
Polish notation 21 
Polyadic algebras 1 23 
Possible definitions 223 
Possible worlds 65 
Post, E.L. 334 
Power 
class 234 
of the continuum 8 
set axiom 236 
PP 221 

Precisely k- valid 9 3 
Predecessor function 177 
Predicate 
calculus 70 
calculus, full 221 
calculus, pure 109, 221 
calculus, pure monadic 222 
letter 5 1 
variables 368 
Predicative wf 232 
Premiss 35 

Prenex normal form 106 
Prenex wf 94 
Presburger arithmetic 169 
Prime number function 181 
Prime number property 180 
Primitive connectives 35 
Primitive recursive 
axiom set 1 97 
function 175 
relation 179 
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Primitive recursive (con tinned) 
vocabulary 192 
Principal 

connective 14 
filter 129 
letter 345 

Principia Mathematica 4, 293 
Principle 

of complete induction 8, 9 
of dependent choices (PDC) 280 
extensionality 227 
least-number 1 66 

of mathematical induction 8, 154—5 
well-ordering 9, 275 
Printing problem 330 
Product 

bounded 178 
Cartesian 6, 233 

Production (simple, terminal) 351-2, 
363 

Productive 343 
- Projection functions 174 
Proof 34—6 
by contradiction 78 
of an equation 346 
Propagation 358 
Proper 

axioms 69-70 
class 226 
filter 1 29 
ideal 9 
inclusion 226 
initial segment 21 
subclass 226 
subset 5 
Property 6, 62 
Proposition 36 
Propositional calculus 1 1 
intuitionistic 48 
Propositional connective 1 3 
Pure 

first-order predicate calculus 109, 221 
first-order theory of equality 98 
full second-order language 370 
monadic predicate calculus 222 

Q 201 

Quadruple of a Turing machine 307 
Quantification theory 50 
Quantifiers 50 
bounded 179 
function and predicate 369 


Quine, W.V. 287, 293, 296, 382 
Quotation marks 13 
Quotient function 177 

R 202 

Ramified type theory 293 
Range 6, 235 
Rank 28 1 

R.e., see Recursively enumerable 
Reading head 306 
Real-close field 362 
Real numbers, nonstandard 1 37 
Recursion 1 74 
course-of- values 185 
theorem 335 
Recursive 
axiom set 197 
function 175 
partial 318 
permutation 342 
relation 1 79 
set 21 1 

vocabulary 1 92 

Recursive, but not primitive recursive 
function 340 
Recursively 
axiomatizable 211 
completable 328 
decidable 216 
enumerable (r.e.) 340 

equivalent 343 
essentially, undecidable 216 
inseparable 219 
solvable 329 
undecidable 21 6 
unsolvable 329 
Reduced direct product 133 
Reducibility, axiom of 293 
Reducible 
one-one 343 
many-one 342 
Reflexive 6 
partial order 8 
total order 9 
Regular ordinal 286 
Regularity axiom 279, 288 
Relation 6, 62, 233 
arithmetical 1 90 
binary 6, 233 
connected 240 
equivalence 6 
expressible 170 
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Relation ( continued ) 
identity 6, 234 
inverse 6, 235 
irreflexive 240 
membership 242 
n-place 6 

number-theoretic 1 70 
primitive recursive 1 79 
recursive 179 
reflexive 6 
symmetric 6 
transitive 6, 240 
universal 335 
weakly expressible 344 
well-ordering 242 
Relations of a finite 
presentation 364 
Relative complement 5 
Relatively interpretable 224 
Relatively prime 1 90 
Relativization 224 
Remainder function 1 77 
Replacement 
axiom 239, 288 
theorem 79 

Representation function 171 
Resolution 32 
Restricted /^-operator 175 
Restriction of a function 7, 238 
Rice’s theorem 336 
Richard’s paradox 2 
Right 

-end machine 314 
machine 313 
Robinson, A. 136 
Robinson, R.M. 

Robinson’s system Q 201 
Rosser, J.B. 

Godel-, theorem 208 9, 219 
sentence 208 
Roy, D.K. 392 
RR 200 
Rule 
A4 76 
C 81-2 
E4 77 

Generalization (Gen) 70 
U 142 

Rules of inference 34 
derived 76-8 
for semantic trees 142 
for systems of equations 346 


Russell, B. 4, 293 
Russell’s paradox 1, 4 

S (first-order arithmetic) 154 
consistency of 160, 212 
Satisfaction relation 60—2 
second-order 369 
Satisfiable 59, 65 
generally 379 

standardly second-order 370 
statement form 3 1 
Scapegoat theory 87 
Scope 52 

Second e-theorem 120 
Second form of transfinite 
induction 248 
Second-order 

general, completeness theorem 379 
generalization rules 372 
language (full) 369 
logic 368 

predicate calculus 373 
semantics 368 
soundness of, logic 373 
Second-order theory 372 
comprehension schema in 372 
function definition schema in 372 
Second-order vs. first-order 
logic 381-2 
Section 242 
Segment 21, 243 
Self-halting problem 329 
Semantic 

paradoxes 3 
trees 141 
Semantical 69, 92 
Semantics, second-order 369 
Henkin 378 
Semigroup 364 
Semi-Thue system 363 
Sentence 
Godel 206 
Henkin 213 
Rosser 208 
undecidable 206 
Sentential class of models 136 
Sequence 
a- 286 

denumerable 8 
finite 8 
Set 1, 5, 226 
arithmetical 2 1 7 
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Set ( continued) 

Cantorian (strongly) 29 5 
closed 140 
countable 8 
creative 342 
Dedekind-finite 261 
Dedekind-infinite 261 
denumerable 8 
effectively decidable 211 
empty (null) 5 
finite 8 
immune 343 

impredicatively defined 293 
infinite 8 
isolated 343 

iterative conception of 282 

open 140 

power 236 

productive 343 

recursive 211 

simple 342 

sum 236 

unit 5 

well-ordered 9 

Set theory with urelements 297 
Sets 

disjoint 5 

recursively inseparable 219 
Shannon, C. 24 
Shift machine 315 
Sierpinski, W. 284 
Similar 

ordered structures 241 
wfs 84 

Similarity mapping 241 
Simple 
f-term 104 
production 352 
set 342 

theory of types (ST) 292 
Singleton 229 
Singular ordinal 286 
Skolem-Lowenheim theorem 92, 101 
Downward 1 28 
Upward 1 28 
Skolem, T. 288, 382 
normal form 109 
Skolem’ s paradox 263 
S-m-n theorem 330 
Solvable 

algorithmically 328 
recursively 329 


Soundness of second-order logic 373 
Special halting problem 329 
ST (simple theory of types) 373 
ST" 296 

ST-co mputable 319 

ST (simple theory of types) 289 

Standard 

interpretation (model) 1 60 
part 1 38 

second-order interpretation 370 
second-order logical 
consequence 370 
Standard semantics, incompleteness 
of 376 

Standard Turing-computable 319 
Standardly (second-order) 
logically imply 370 
satisfiable, valid 370 
State 

initial 307 
internal 307 
valid formulas (SV) 375 
Statement 
form 1 3 
letter 13, 35 
Stops 308 
Stratified wf 294 
Strongly 
Cantorian 295 
inaccessible 286 
representable 1 7 1 
Subclass 226 
proper 226 
Submodel 124 
generated by 125 
Subset 5 
proper 5 

Subsets axiom 236 
Substitution 174 
Substitutivity of equality 95, 288 
Substructure 124 
Subtheory 86 
Successor 1 54, 29 1 
function 1 74 
ordinal 246 

Sufficiently strong theory 212, 224 

Suitable 45 

Sum 

bounded 1 78 
of cardinals 258 
class 234 
set axiom 236 
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Super-universal Turing machine 332 

SV 375 

Symbol 34 

Symmetric 6 

Syntactical 69, 92 

System of equations 345 

Tape 306 
description 307 
description, numerical 323 
representation 309 
symbols 307 
Tarski, A. 

Tarski’s theorem 217 
-Vaught theorem 126 
Tautology 16 

Teichmuller-Tukey lemma 277 
Term 51, 345 
closed 87 

Terminal production 3 52 
Theorem 34 
Theory 71 
axiomatic 34,211 
complete 86 
consistent 72 
decidable 34, 362 
of densely ordered sets 98 
of equality 98 
with equality 94—5, 99 
essentially incomplete 211 
essentially recursively 
undecidable 216 
first-order 69 
formal 18, 34 
generalized first-order 114 
inconsistent 72 
ramified type 293 
recursively axiomatizable 211 
recursively decidable 216 
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second-order 372 
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undecidable 34, 362 
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up to co, up to <5 248 
Transitive 
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relation 6, 240 
Trees, semantic 141 
basic principle of 143 
rules for 142 
Trichotomy (Trich) 275 
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for an interpretation 60 
for a standard second-order 
interpretation 370 
logically 1 8 
theory 205 
Truss, J. 303 
Truth 
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Truth table 11, 14 
abbreviated 14 
Turing, A.M. 305 
algorithm 308 
-computable 309 
-computable, standard 319 
Turing machine 306-7 
alphabet 306-7 
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halting problem 328 
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right, left, constant 313 
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shift 315 
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ramified 293 
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Well-formed formula (wf) 34, 52 
closed 58 
decidable 1 69 
open 68 
predicative 232 
prenex 94 
similar wfs 84 
stratified 294 
Well-ordered set 9 
Well-ordering 9, 240, 242 
principle (WO) 9, 275 
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